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CROSS REFERENCE TO RELATED APPLICATION 
This application claims me benefit of U.S. Provisional Patent Application 60/480,035, filed 
June 19, 2003, which is incorporated herein its entirety. 

FIELD 

This disclosure relates to the field of taste reception, and more particularly to variations in 
taste receptors, such as bitter taste receptors including those in the T2R family. It further relates to 
methods for identifying individuals and populations having certain taste receptor variants and 
identifying compounds that interact with taste receptors, including compounds that interact 
differentially with different variants of a taste receptor. 

BACKGROUND 

Bitter taste is believed to have evolved in order to allow organisms to detect and avoid 
toxins from the environment. The sense of bitter taste is mediated by a group of 24 apparently 
functional bitter taste receptor proteins that reside on the surface of taste cells within the taste buds of 
the tongue. These receptors are 7-transmembrane domain, Gprotein coupled receptors, encoded by 
members of the T2R gene family. In contrast to TIRs, which also belong to the superfamily of G 
protein-coupled receptors and have a large N-tenninal domain, T2R bitter taste receptors generally 
have a short extracellular N terminus. These cell surface receptors interact with tastants and initiate 
signaling cascades that culminate in neurotransmitter release and bitter taste perception. The human 
genome contains 24 apparently functional T2R genes, which reside in three locations. Fourteen 
genes reside in a cluster on chromosome 12pl3, nine genes reside in a cluster on chromosome 7q3 1 
and a single family member resides on chromosome 5pl 5 (Shi, et aL, Mol Biol. Evol. 20:805-814 ' 
2003). These genes all contain a single coding exon (approximately 1 kb in length) that encodes a 
receptor averaging approximately 300 amino acids in length. 

individual members of the T2R family exhibit 30%-70% amino acid identity. The most 
highly conserved sequence motifs reside in the first and last transmembrane segments, and also in the 
second cytoplasmic loop. The most divergent regions are the extracellular segments, extending 
partway into the transmembrane helices, possibly reflecting the need to recognize structurally diverse 
ligands. 

Taste sensitivity to the bitter compound phenyltWocarbamide (PTC) and related chemicals is 
bimodally distributed, and virtually all human populations tested to date contain some people who 
can (tasters) and some people who cannot taste (nontasters) PTC. The frequency of tasters in North 
Americans of European ancestry is about 7 0 o/ 0 . The PTC taste receptor encoded on chromosome 7 
was recently identified as a taste receptor that mediates the bitter taste of at least PTC (Kim et at. 
Science 299:1221-1225, 2003). 



WO 2005/007891 



PCT/US2004/019489 



-2- 



10 



15 



Although PTC itself has not been found in nature, the ability to taste PTC is correlated 
strongly with the ability to taste other naturally occurring bitter substances, many of which are toxic 
(Harris and Kahnus,^„^ gew 15:32-45, 1949; Bamicot et al, Ann Eugen 16:119-128, 19;Tepper 
Am J Hum Genet 63 :1211-1216, 1998). Furthermore, variation in PTC taste sensitivity has been 
correlated with dietary preferences that may have significant health effects (Bartoshuk et al. 1 994). 
For example, PTC is similar in structure to isothiocyanates (compounds containing the group N-C=S) 
and goitrin, both of which are bitter substances found in cruciferous vegetables like cabbage and 
broccoli (Tepper, Am J Hum Genet 63:1271-1276, 1998). Variable aversions to these compounds 
have been implicated in the variable rates of thyroid-deficiency disease in PTC tasters and nontasters, 
with nontasters being more susceptible (Drewnowski and Rock, Am J Clin Nutr 62:506-51 1, 1995). 

Identifying receptor-ligand relationships for T2Rs has been difficult, and the nature' of the 
ligand that binds to each receptor and initiates bitter taste perception is known for only a few of these 
receptors. In humans, in vitro cell based assays have shown that T2R16 responds to salicin and other 
beta-glucopyranosides and T2R10 displays activity upon exposure to strychnine (Bufe, et al, Nat. 
Genet. 32:397-401, 2002). An alternative human genetic approach has revealed that T2R38 (PTC) 
encodes the receptor for phenylthiocarbamide, a classic variant trait in humans (Kim, et al. Science 
299:1221-1225, 2003). The bitter tastant ligands that activate the remaining 22 human T2R proteins 
are not well characterized. 
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20 SUMMARY OF THE DISCLOSURE 

This disclosure provides a comprehensive collection of single nucleotide polymorphisms 
(SNPs) in bitter taste receptor (T2R) genes (Figure 1). It is believed that a portion of these SNPs 
define biologically relevant difference between different alleles of the bitter taste receptor genes. 
Included in the disclosure are sub-sets of the bitter taste receptor SNPs that represent conserved, non- 
conserved, silent, and truncation mutations in the corresponding proteins, as well as individual allelic 
sequences for the various bitter taste receptor genes. 

The disclosure further provides methods for using the corresponding allelic variants of the 
taste receptor genes, alone or in various combinations, to test a subject's bitter tasting profile, and to 
identify and analyze compounds that interact with and/or influence bitter tastes in subjects. 

Also provided is a substantially comprehensive set of haplotypes for nearly all of the T2R 
bitter taste receptors (T2R1, T2R3, T2R4, T2R5, T2R7, T2R8, T2R9, T2R10, T2R13 T2R14 
T2R16, T2R38, T2R39, T2R40, T2R41, T2R43, T2R44, T2R46, T2R47, T2R48, T2R49, T2R50, and 
T2R60). Details of the haplotypes, and the T2R isoforms encoded thereby, are provided in Table 7. 

The foregoing and other features and advantages will become more apparent from the 
following detailed description of several embodiments, which proceeds with reference to the 
accompanying figures. 
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BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 (including pages 1-5) is a table showing SNPs identified in the indicated T2R 
bitter taste receptor genes. 

Figure 2 is a graph showing the distribution of cSNPs among the five population samples. 
The cSNPs were categorized as to whether they were variable in one, two, three, four, or all five 
populations. Population codes are CAM, Cameroonians; AME, Amerindians; JAP, Japanese; HUG, 
Hungarians; PYG; Pygmies. 

Figure 3 is a graph showing the distributions of Tajima's D statistic. Dotted line indicates 
theoretical expectation under the assumption that human population sizes have been constant 
Dashed line indicates theoretical expectation under the assumption that the human population sizes 
increased from 10,000 to 1,000,000, 100,000 years ago. Observed fractions were calculated across 
all genes (EGP and T2R). EGP genes were resequenced in the 90-member NJH polymorphism 
discovery resource as part of the Environmental Genome Project These genes encode proteins 
thought to be important in mediating Ihe interface between the human body and the environment. 
1 5 Observed T2R genes are the genes resequenced for mis study. 

Figure 4 is a minimum spanning tree of T2R49 haplotypes. Each circle represents a 
haplotype, the area of the circle represents the haplotype frequency, and shading indicates the fraction 
at which the haplotype was observed in each continental sample. Each slash represents one 
nucleotide substitution. Amino acid substitutions are denoted with letter-number combinations. 
20 Europe and Africa are dominated by Cluster 1 while Asia and Amerindian are dominated by 
Cluster 2. 

SEQUENCE LISTING 
The DNA and amino acid sequences listed in the accompanying sequence listing are shown 
25 using standard letter abbreviations for nucleotide bases, and three letter code for amino acids, as 
defined in 37 C.F.R. 1.822. Only one strand of each nucleic acid sequence is shown, but the 
complementary strand is understood as included by any reference to the displayed strand. In the 
accompanying sequence listing: 

30 SEQ ID NO: 1 shows the coding nucleic acid sequence of bitter taste receptor gene T2R1, 

and the protein encoded thereby. One SNP is indicated. 

SEQ ID NO: 2 shows the protein sequence of the T2R1 bitter taste receptor. 
SEQ JD NO: 3 shows the coding nucleic acid sequence of bitter taste receptor gene T2R3, 
and the protein encoded thereby. Three SNPs are indicated. 
35 SEQ ID NO: 4 shows the protein sequence of the T2R3 bitter taste receptor. 

SEQ ID NO: 5 shows the coding nucleic acid sequence of bitter taste receptor gene T2R4, 
and the protein encoded thereby. Six SNPs are indicated. 

SEQ ID NO: 6 shows the protein sequence of the T2R4 bitter taste receptor. 
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SEQ ID NO: 7 shows the coding nucleic acid sequence of bitter taste receptor gene T2R5, 
and Ihe protein encoded thereby. Six SNPs are indicated. 

SEQ ID NO: 8 shows the protein sequence of the T2R5 bitter taste receptor. 

SEQ ID NO: 9 shows the coding nucleic acid sequence of bitter taste receptor gene T2R7, 
5 and the protein encoded thereby. One SNP is indicated. 

SEQ ID NO: 10 shows the protein sequence of the T2R7 bitter taste receptor. 

SEQ ID NO: 1 1 shows the coding nucleic acid sequence of bitter taste receptor gene T2R8, 
and the protein encoded thereby. Four SNPs are indicated. 

SEQ ID NO: 12 shows the protein sequence of the T2R8 bitter taste receptor. 
10 SE Q 10 NO: 13 shows the coding nucleic acid sequence of bitter taste receptor gene T2R9, 

and the protein encoded thereby. Five SNPs are indicated. 

SEQ ID NO: 14 shows the protein sequence of the T2R9 bitter taste receptor. 

SEQ ID NO: 15 shows the coding nucleic acid sequence of bitter taste receptor gene T2R10, 
and the protein encoded thereby. Five SNPs are indicated. 
15 SEQ ID NO: 16 shows the protein sequence of the T2R10 bitter taste receptor. 

SEQ ID NO: 17 shows the coding nucleic acid sequence of bitter taste receptor gene T2R13, 
and the protein encoded thereby. One SNP is indicated. 

SEQ ID NO: 18 shows the protein sequence of the T2R13 bitter taste receptor. 

SEQ ID NO: 19 shows the coding nucleic acid sequence of bitter taste receptor gene T2R14, 
20 and the protein encoded thereby. Two SNPs are indicated. 

SEQ ID NO: 20 shows the protein sequence of the T2R14 bitter taste receptor. 

SEQ ID NO: 21 shows the coding nucleic acid sequence of bitter taste receptor gene T2R16, 
and the protein encoded thereby. Seven SNPs are indicated. 

SEQ ID NO: 22 shows the protein sequence of the T2R16 bitter taste receptor. 
25 SEQ ID NO: 23 shows the coding nucleic acid sequence of bitter taste receptor gene T2R38, 

and the protein encoded thereby. Five SNPs are indicated. 

SEQ ID NO: 24 shows the protein sequence of the T2R38 bitter taste receptor, also known 
as the PTC taste receptor. 

SEQ ID NO: 25 shows the coding nucleic acid sequence of bitter taste receptor gene T2R39, 
30 and the protein encoded thereby. Two SNPs are indicated. 

SEQ ID NO: 26 shows the protein sequence of the T2R39 bitter taste receptor. 
SEQ ID NO: 27 shows the coding nucleic acid sequence of bitter taste receptor gene T2R40, 
and the protein encoded thereby. Two SNPs are indicated. 

SEQ ID NO: 28 shows the protein sequence of the T2R40 bitter taste receptor. 
35 SEQ ID NO: 29 shows the coding nucleic acid sequence of bitter taste receptor gene T2R41, 

and the protein encoded thereby. Three SNPs are indicated. 

SEQ ID NO: 30 shows the protein sequence of the T2R41 bitter taste receptor. 
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SEQ ID NO: 3 1 shows the coding nucleic acid sequence of bitter taste receptor gene T2R43 
(GenBank Accession No. AF494237), and the protein encoded thereby. Ten SNPs are indicated. 

SEQ ID NO: 32 shows the protein sequence of the T2R43 bitter taste receptor. 

SEQ ID NO: 33 shows the coding nucleic acid sequence of bitter taste receptor gene T2R44, 
5 and the protein encoded thereby. Ten SNPs are indicated. 

SEQ ID NO: 34 shows the protein sequence of the T2R44 bitter taste receptor. 

SEQ ID NO: 35 shows the coding nucleic acid sequence of bitter taste receptor gene T2R46, 
and the protein encoded thereby. Four SNPs are indicated. 

SEQ ID NO: 36 shows the protein sequence of the T2R46 bitter taste receptor. 
1 0 SEQ ID NO: 37 shows the coding nucleic acid sequence of bitter taste receptor gene T2R47, 

and the protein encoded thereby. 

SEQ ID NO: 38 shows the protein sequence of the T2R47 bitter taste receptor. 

SEQ ID NO: 39 shows the coding nucleic acid sequence of bitter taste receptor gene T2R48, 
and the protein encoded thereby. Ten SNPs are indicated. 
1 5 SEQ ID NO: 40 shows the protein sequence of the T2R48 bitter taste receptor. 

SEQ ID NO: 41 shows the coding nucleic acid sequence of bitter taste receptor gene T2R49, 
and the protein encoded thereby. Ten SNPs are indicated. 

SEQ ID NO: 42 shows the protein sequence of the T2R49 bitter taste receptor. 

SEQ ID NO: 43 shows the coding nucleic acid sequence of bitter taste receptor gene T2R50, 
20 and the protein encoded thereby. 

SEQ ID NO: 44 shows the protein sequence of the T2R50 bitter taste receptor. 

SEQ ID NO: 45 shows the coding nucleic acid sequence of bitter taste receptor gene T2R60, 
and the protein encoded thereby. Two SNPs are indicated 

SEQ ID NO: 46 shows the protein sequence of the T2R60 bitter taste receptor. 
25 SEQ ID NOs: 47 (GenBank Accession No. AF227129), 49, and 51 (GenBank Accession 

No. AC026787.5) show the coding nucleic acid sequence of haplotypes of the T2R1 bitter taste 
receptor gene, and the proteins encoded thereby. 

SEQ ID NOs: 48, 50, and 52 show the protein sequences of the haplotypes of the T2R1 
bitter taste receptor. 

30 SEQ ID NOs: 53 (GenBank Accession No. AF227130) and 55 show the coding nucleic acid 

sequence of haplotypes of the T2R3 bitter taste receptor gene, and the proteins encoded thereby. 

SEQ ID NOs: 54 and 56 show the protein sequences of the haplotypes of the T2R3 bitter 
taste receptor. 

SEQ ID NOs: 57, 59, 61 (GenBank Accession No. AF227131), 63, 65, 67, 69, and 71 show 
35 the coding nucleic acid sequence of haplotypes of the T2R4 bitter taste receptor gene, and the 
proteins encoded thereby. 

SEQ ID NOs: 58, 60, 62, 64, 66, 68, 70, and 72 show the protein sequences of the 
haplotypes of the T2R4 bitter taste receptor. 
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SEQ ID NOs: 73 (GenBank Accession No. AF227132), 75, 77, 79, 81, 83, and 85 show the 
coding nucleic acid sequence of haplotypes of the T2R5 bitter taste receptor gene, and the proteins 
encoded thereby. 

SEQ ID NOs: 74, 76, 78, 80, 82, 84, and 86 show the protein sequences of the haplotypes of 
5 the T2R5 bitter taste receptor. 

SEQ ID NOs: 87 (GenBank Accession No. AF227133), 89, 91, 93, and 95 show the coding 
nucleic acid sequence of haplotypes of the T2R7 bitter taste receptor gene, and the proteins encoded 
thereby. 

SEQ ID NOs: 88, 90, 92, 94, and 96 show the protein sequences of the haplotypes of the 
10 . T2R7 bitter taste receptor. 

SEQ ID NOs: 97 (GenBank Accession No. AF227134), 99, 101, 103, 105, and 107 show the 
coding nucleic acid sequence of haplotypes of the T2R8 bitter taste receptor gene, and the proteins 
encoded thereby. 

SEQ ID NOs: 98, 100, 102, 104, 106, and 108 show the protein sequences of the haplotypes 
15 of the T2R8 bitter taste receptor. 

SEQ ID NOs: 109 (GenBank Accession No. AF227135), 111, 113, 115, 117, 119, 121, and 
123 show the coding nucleic acid sequence of haplotypes of the T2R9 bitter taste receptor gene, and 
the proteins encoded thereby. 

SEQ ID NOs: 110, 112, 114, 116, 118, 120, 122, and 124 show the protein sequences of Ihe 
20 haplotypes of the T2R9 bitter taste receptor. 

SEQ ID NOs: 125, 127, 129, and 131 (GenBank Accession No. AF227136) show the coding 
nucleic acid sequence of haplotypes of the T2R10 bitter taste receptor gene, and Ihe proteins encoded 
thereby. 

SEQ ID NOs: 126, 128, 130, and 132 show the protein sequences of the haplotypes of the 
25 T2R10 bitter taste receptor. 

SEQ ID NOs: 133 (GenBank Accession No. AF227137) and 135 show the coding nucleic 
acid sequence of haplotypes of the T2R13 bitter taste receptor gene, and the proteins encoded 
Ihereby. 

SEQ ID NOs: 134 and 136 show the protein sequences of the haplotypes of the T2R13 bitter 
30 taste receptor. 

SEQ ID NOs: 137 (GenBank AccessionNo. AF227138), 139, and 141 show the coding 
nucleic acid sequence of haplotypes of the T2R14 bitter taste receptor gene, and the proteins encoded 
thereby. 

SEQ ID NOs: 138, 140, and 142 show the protein sequences of the haplotypes of the T2R14 
35 bitter taste receptor. 

SEQ ID NOs: 143 (GenBank AccessionNo. CQ740130.1), 145 (GenBank Accession No. 
AF227139), 147, 149, and 151 show the coding nucleic acid sequence of haplotypes of the T2R16 
bitter taste receptor gene, and the proteins encoded thereby. 
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SEQ ID NOs: 144, 146, 148, 150, and 152 show the protein sequences of the haplotypes of 
the T2R16 bitter taste receptor. 

SEQIDNOs: 153 (GenBank Accession No. AY25 85 97.1), 155, 157, 159 (GenBank 
Accession Nos. AX647247.1 and AY1 14095.1), 161, 163, and 165 (GenBank Accession No. 
AF494231) show the coding nucleic acid sequence of haplotypes of the T2R38 bitter taste receptor 
gene, and the proteins encoded thereby. 

SEQ ID NOs: 154, 156, 158, 160, 162, 164, and 166 show the protein sequences of the 
haplotypes of the T2R38 bitter taste receptor. 

SEQ ID NOs: 167 (GenBank Accession No. AF494230) and 169 show the coding nucleic 
acid sequence of haplotypes of the T2R39 bitter taste receptor gene, and the proteins encoded 
thereby. 

SEQ ID NOs: 168 and 170 show the protein sequences of the haplotypes of the T2R39 bitter 
taste receptor. 

SEQ ID NOs: 171 (GenBank Accession No. AF494229), 173, 175, and 179 show the coding 
15 nucleic acid sequence of haplotypes of the T2R40 bitter taste receptor gene, and the proteins encoded 
thereby. 

SEQ ID NOs: 172, 174, 176, and-180 show the protein sequences of the haplotypes of the 
T2R40 bitter taste receptor. 

SEQIDNOs: 181, 183, and 185 (GenBank Accession No. AF494232) show the coding 
20 nucleic acid sequence of haplotypes of the T2R41 bitter taste receptor gene, and the proteins encoded 
thereby. 

SEQ ID NOs: 182, 184, and 186 show the protein sequences of the haplotypes of the T2R41 
bitter taste receptor. 

SEQ ID NOs: 187, 189, 191, 193 (GenBank Accession No. AF494228), 195 (GenBank 
25 Accession No. AX647301.1 and AC018630.40), 197, and 199 show the coding nucleic acid sequence 
of haplotypes of the T2R44 bitter taste receptor gene^ and the proteins encoded thereby. 

SEQ ID NOs: 188, 190, 192, 194, 196, 198, and 200 show the protein sequences of the 
haplotypes of the T2R44 bitter taste receptor. 

SEQ ID NOs: 201, 203, 205, 207 (GenBank Accession No. AF494227), 209, and 21 1 show 
30 the coding nucleic acid sequence of haplotypes of the T2R46 bitter taste receptor gene, and the 
proteins encoded thereby. 

SEQ ID NOs: 202, 204, 206, 208, 210, and 212 show the protein sequences of the 
haplotypes of the T2R46 bitter taste receptor. 

SEQ ID NOs: 213, 215 (GenBank Accession No. AF494233), 217, and 219 show the coding 
35 nucleic acid sequence of haplotypes of the T2R47 bitter taste receptor gene, and the proteins encoded 
thereby. 

SEQ ID NOs: 214, 216, 218, and 220 show the protein sequences of the haplotypes of the 
T2R47 bitter taste receptor. 
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SEQ ID NOs: 221 (GenBank Accession no. CQ800016.1), 223 (GenBank Accession No. 
AF494234), 225, 227, 229, 231, 233, 235, and 237 show the coding nucleic acid sequence of 
haplotypes of the T2R48 bitter taste receptor gene, and the proteins encoded thereby. 

SEQ ID NOs: 222, 224, 226, 228, 230, 232, 234, 236, and 238 show the protein sequences 
5 of the haplotypes of the T2R48 bitter taste receptor. 

SEQ ID NOs: 239 (GenBank Accession No. AF494236), 241, 243, 245, 247, 249 and 251 
show the coding nucleic acid sequence of haplotypes of the T2R49 bitter taste receptor gene, and the 
proteins encoded thereby. 

SEQ ID NOs: 240, 242, 244, 246, 248, 250, and 252 show the protein sequences of the 
10 haplotypes of the T2R49 bitter taste receptor. 

SEQ ID NOs: 253, 255 (GenBank Accession No. AF494235), 257, and 259 show the coding 
nucleic acid sequence of haplotypes of the T2R50 bitter taste receptor gene, and the proteins encoded 
thereby. 

SEQ ID NOs: 254, 256, 258 and 260 show the protein sequences of the haplotypes of the 
15 T2R50 bitter taste receptor. 

SEQ ID NOs: 261 (GenBank Accession No. AY1 14094) and 263 show the coding nucleic 
acid sequence of haplotypes of the T2R60 bitter taste receptor gene, and the proteins encoded 
Ihereby. 

SEQ ID NOs: 262 and 264 show the protein sequences of the haplotypes of the T2R60 bitter 
20 taste receptor. 
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/. Abbreviations 



DETAILED DESCRIPTION 



9* 2 aoa AGE two-dimensional polyacrylamide gel electrophoresis 

2D allele-specific oligonucleotide 

ASOH allele-specific oligonucleotide hybridization 

DASH dynamic allele-specific hybridization 

ELISA enzyme-linked immunosorbant assay 

HPLC high pressure liquid chromatography 
MALDI-TOF matrix-assisted laser desorption/ionization time-of-flight 

PCR polymerase chain reaction 

RT-PCR reverse-transcription polymerase chain reaction 

SNP single nucleotide polymorphism 

SSCP single-strand conformation polymorphism 



30 MALDI-TOF 



//. Terms 

Unless otherwise noted, technical terms are used according to conventional usage. 
Definitions of common terms in molecular biology may be found in Benjamin Lewin, Genes V 7 
published by Oxford University Press, 1994 (ISBN 0-19-854287-9); Kendrew et al (Ids.), Tlie 
Encyclopedia of Molecular Biology, published by Blackwell Science Ltd., 1994 (ISBN 0-632-02182- 
9); and Robert A. Meyers (ed.), Molecular Biology and Biotechnology: a Comprehensive Desk 
Reference, published by VCH Publishers, Inc., 1995 (ISBN 1-56081-569-8). 
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la order to facilitate review of the various embodiments of the invention, the following 
explanations of specific terms are provided: 

Addressable: Capable of being reliably and consistently located and identified, as in an 
addressable location on an array. 

Allele: Aparticular form of a genetic locus, distinguished from other forms by its specific 
nucleotide sequence. 

Amplified RNA (amRNA): A molecule of RNA generated through in vitro transcription 
with T7 or other promoter region attached to the 5' end of the template. 

Antisense, Sense, and Antigene: Double-stranded DNA (dsDNA) has two strands, a 5' -> 
3 strand, referred to as the plus strand, and a 3' -> 5' strand (the reverse complement), referred to as 
the nanus strand. Because RNA polymerase adds nucleic acids in a 5' -> 3 ■ direction, the minus 
strand of the DNA serves as the template for the RNA during transcription. Thus, the RNA formed 
will have a sequence complementary to the minus strand and identical to the plus strand (except mat 
15 U is substituted for T). 

Antisense molecules are molecules that are specifically hybridizable or specifically 
complementary to either RNA or the plus strand of DNA. Sense molecules are molecules that are 
specifically hybridizable or specifically complementary to the minus strand of DNA. Antigene 
molecules are either antisense or sense molecules directed to a dsDNA target 

Array: An arrangement ofmolecules, particularly biological macromolecules (such as 
polypeptides or nucleic acids) or biological samples (such as tissue sections) in addressable locations 
on a substrate, usually a flat substrate such as a membrane, plate or slide. The array may be regular 
arranged inuniformrows and columns, for instance) or irregular. The number of addressable 
locations on the array can vary.for example from a few (such as three)to more than50, 100 200 
500,1000, 10,000, or more. A "nucroarray" is an array fi,at is rniniaturized to such an extenl ^ 
benefits from microscopic examination for evaluation. 

«fe ta " T' ^ ^ m0l6CUle ieS ' °^ 0nUcleotide ) « -mple (-re generally, a 

future .of the array) is addressable, in that its location can be rehably and consistently determined 
within the at least two dimensions on the array surface. Thus, in ordered arrays the location of each 
feature is usually assigned to a sample at the time when it is spotted onto or otherwise applied to the 
array surface, and a key may be provided in order to correlate each location with the appropriate 
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Often, ordered arrays are arranged in a symmetrical grid pattern, but samples could be 
arranged in other patterns (e. g ., in radially distributed lines, spiral lines, or ordered clusters) Arrays 
are computer readable, in that a computer can be programmed to correlate a particular address on the 
array with information (such as identification of the arrayed sample and hybridization or binding 
data, mcluding for instance signal intensity). In some examples of computer readable array formats 
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the individual spots on the array surface will be arranged regularly, for instance in a Cartesian grid 
pattern, that can be correlated to address information by a computer. 

The sample application spot (or feature) on an array may assume many different shapes 
Thus, though the term "spot" is used herein, it refers generally to a localized deposit of nucleic acid 
or other biomolecule, and is not limited to a round or substantially round region. For instance 
substantially square regions of application can be used with arrays, as can be regions that are ' 
substantially rectangular (such as a slot blot-type application), or triangular, oval, irregular, and so 
forth. The shape of the array substrate itself is also immaterial, though it is usually substantially flat 
and may be rectangular or square in general shape. 

Binding or interaction: An association between two substances or molecules, such as the 
hybruhzation of one nucleic acid molecule to another (or itself). Disclosed arrays are used to detect 
binding of, in some embodiments, a labeled nucleic acid molecule (target) to an immobilized nucleic 
acxd molecule (probe) in one or more features of the array. A labeled target molecule "binds" to a 
nucleic acid molecule in a spot on an array if, after incubation of the (labeled) target molecule 
(usually in solution or suspension) with or on the array for a period of time (usually 5 minutes or 
more, for instance 10 minutes, 20 minutes, 30 minutes, 60 minutes, 90 minutes, 120 minutes or more 
for instance over night or even 24 hours), a detectable amount of that molecule associates with a 
nucleic acid feature of the array to such an extent that it is not removed by being washed with a 
relatively low stringency buffer ( e .g. , higher salt (such as 3 X SSC or higher), room temperature 
washes). Washing can be carried out, for instance, at room temperature, but other temperatures 
(exther higher or lower) also can be used. Targets will bind probe nucleic acid molecules within 
different features on the array to different extents, based at least on sequence homology, and the term 
W encompasses both relatively weak and relatively strong interactions. Thus, some binding will 
persist after the array is washed in a more stringent buffer (e.g., lower salt (such as about 0.5 to about 
25 1 .5 x SSC), 55-65* C washes). 

Where the probe and target molecules are both nucleic acids, binding of the test or reference 
molecule to a feature on the array can be discussed in terms of the specific complementarity between 
the probe and the target nucleic acids. Also contemplated herein are protein-based arrays, where the 
probe molecules are or comprise proteins, and/or where the target molecules are or comprise proteins 

cDNA: A DNA molecule lacking internal, non-coding segments (e.g., introns) and 
regulatory sequences that determine transcription. By way of example, cDNA may be synthesized in 
the laboratory by reverse transcription from messenger RNA extracted from cells. 

DNA (deoxyribonucleic acid): DNA is a long chain polymer that contains the genetic 
material of most living organisms (the genes of some viruses are made of ribonucleic acid (RNA)) 
The repeating units in DNA polymers are four different nucleotides, each of which includes one of 
the four bases (adenine, guanine, cytosine and myrnine) bound to a deoxyribose sugar to which a 
phosphate group is attached. Triplets of nucleotides (referred to as codons) code for each amino acid 
in a polypeptide, or for a stop signal. The term "codon" is also used for the corresponding (and 
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cornplementary) sequences of three nucleotides in the mRNA into which the DNA sequence is 
transcribed. 

Enriched: The term "enriched" means that the concentration of a material is at least about 
2, 5, 10, 100, or 1000 times its natural concentration (for example), advantageously at least 0.01% by 
5 weight. Enriched preparations of about 0.5%, 1%, 5%, 10%, and 20% by weight are also 
contemplated. • 

EST (Expressed Sequence Tag): A partial DNA or cDNA sequence, typically of between 
200 and 2000 sequential nucleotides, obtained from a genomic or cDNA library, prepared from a 
selected ceU, cell type, tissue or tissue type, organ or organism, which corresponds to an mRNA of a 

10 gene found in that library. An EST is generally a DNA molecule sequenced from and shorter than 
the cDNA from which it is obtained. 

Fluorophore: A chemical compound, which when excited by exposure to a particular 
wavelength of light, emits tight (i.e., fluoresces), for example at a different wavelength. Fluorophores 
can be described in terms of their emission profile, or "color." Green fluorophores, for example Cy3, 

15 FITC, and Oregon Green, are characterized by their emission at wavelengths generally in the range of 
515-540 X. Red fluorophores, for example Texas Red, Cy5 and tetramethykhodamine, are 
characterized by their emission at wavelengths generally in the range of 590-690 JL 

Examples of fluorophores are provided in U.S. Patent No. 5,866,366 to Nazarenko et al, 
and include for instance: 4-acetarmdo-4'-isotmocyanatostilbene-2,2'disulfonic acid, acridine and 

20 derivatives such as acridine and acridine isothiocyanate, 5-(2'-anunoemyl)aminonaphthalene-l- 

sulfonic acid (EDANS), 4-ammo-N-[3-vmylsulfonyl)phenyl]mphthalimide-3,5 ^sulfonate (Lucifer 
Yellow VS), N-(4-anilino-l-naphmyl)maleimide, anthranilamide, Brilliant Yellow, coumarin and 
derivatives such as coumarin, 7-arnino-4-methylcoumarin (AMC, Coumarin 120), 7-amino-4- 
trifluoromemylcouluarin(Coumaran 151); cyanosine; 4\6-maminidino-2-phenylindole (DAPI); 5', 

25 5"-dibromopyrogaUol-sulfoneph1halein (Bromopyrogallol Red); 7-dttethylamino-3-(4'- 
isounocyanatophenyl)-4-me1hylcoumarin; diethylenetriamine pentaacetate; 4,4'- 
dusomiocyamtodmydro-smbene-^'-disulfonic acid; 4,4'-diisothiocyanatostilbene-2,2'-disulfonic 
acid; 5-[dimethylamino]naphuialene-l-sulfonyl chloride (DNS, dansyl chloride); 4-(4'- 
dime%laminophenylazo)benzoic acid (DABCYL); 4-dimemykminophenylazophenyl-4'- 

30 isothiocyanate (DABITC); eosin and derivatives such as eosin and eosin isothiocyanate; erythrosin 
and derivatives such as erythrosin B and erythrosin isothiocyanate; ethidium; fluorescein and 
derivatives such as 5-carboxyfluorescein (FAM), 5-(4,6-dicMorotria2m-2-yl)aminofluorescein 
(DTAF), 2'7'-dimemoxy-4'5'-dicliloro-6-carboxyfluorescein (JOE), fluorescein, fluorescein 
isothiocyanate (FITC), and QFITC (XRITC); fluorescamine; IR144; IR1446; Malachite Green 

35 isothiocyanate; 4-methylumbelliferone; ortho cresolphthalein; nitrotyrosine; pararosaniline; Phenol 
Red; B-phycoerythrin; o-phthaldialdehyde; pyrene and derivatives such as pyrene, pyrene butyrate 
and succinimidyl 1-pyrene butyrate; Reactive Red 4 (Cibacron .RTM. Brilliant Red 3B-A); 
rhodamine and derivatives such as 6-carboxy-X-rhodamine (ROX), 6-carboxyrhodamine (R6G), 
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lissamine rhodamine B sulfonyl chloride, rhodamine (Rhod), rhodamine B, rhodamine 123, 
rhodamine X isothiocyanate, sulforhodaraine B, sulforhodamine 101 and sulfonyl chloride derivative 
of sulforhodamine 101 (Texas Red); N,N,NSN'-tetramethyl-6-carboxyrhodamine (TAMRA); 
tetramethyl rhodamine; tetramethyl rhodamine isothiocyanate (TRITC); riboflavin; rosolic acid and 
5 terbium chelate derivatives. 

Other contemplated fluorophores include GFP (green fluorescent protein), Lissamine™, 
diethylarriinocoumarin, fluorescein chlorotriazinyl, naphthofluorescein, 4,7-dicMororhodamine and 
xanthene and derivatives thereof. Other fluorophores known to those skilled in the art may also be 
used. 

10 Examples of fluorophores that are sensitive to ion concentration (such as Ca 2+ concentration 

or flux) include, but are not limited to, bis-(l,3-dibutylbarbituric acid)trimethine oxonol (DiBAC4(3) 
(B-438), Quin-2 (AM Q-1288), Fura-2 (AM F-1225), Indo-1 (AM 1-1226), Fura-3 (AM F-1228), 
Fluo-3 (AM F-1241), Rhod-2, (AM R-1244), BAPTA (AM B-1205), 5,5 '-dimethyl BAPTA (AM D- 
1207), 4,4'-difluoro BAPTA (AM D-1216), 5,5'-difluoro BAPTA (AM D-1209), 5,5'-dibromo 

15 BAPTA (AM D-1213), Calcium Green (C-301 1), Calcium Orange (C-3014), Calcium Crimson (C- 
3017), Fura-5 (F-3023), Fura-Red (F-3020), SBFI (S-1262), PBFI (P-1265), Mag-Fura-2 (AM M- 
1291), Mag-Indo-1 (AM M-1294), Mag-Quin-2 (AM M-1299), Mag-Quin-1 (AM M-1297), SPQ (M- 
440), SPA (S-460), Calcien (Fluorescein-bis(methyliminodiacetic acid); Fluorexon), and Quin-2 (2- 
• { [2-Bis-(carboxymethyl)airnno-5-methylphenoxy]-methyl} -6-methoxy-8-bis- 

20 (carboxymethyl)armnoquinoline tetrapotassium salt). Many of these (and other calcium sensing 
compounds known to those of ordinary skill) are available, for instance, from Molecular Probes, 
Invitrogen Detection Technologies, Eugene, OR. 

Haplotype: The ordered, linear combination of polymorphisms (e.g., SNPs) in the sequence 
of each form of a gene (on individual chromosomes) that exists in the population. 

25 Haplotyping: Any process for determining one or more haplotypes in an individual. 

Example methods are described herein, and may include use of family pedigrees, molecular 
biological techniques, statistical inference, or any combination thereof. 

High throughput genomics: Application of genomic or genetic data or analysis techniques 
that use microarrays or other genomic technologies to rapidly identify large numbers of genes or 

30 proteins, or distinguish their structure, expression or function from normal or abnormal cells or 
tissues, or from cells or tissues of subjects with known or unknown phenotype and/or genotype. 

Human Cells: Cells obtained from a member of the species Homo sapiens. The cells can 
be obtained from any source, for example peripheral blood, urine, saliva, tissue biopsy, surgical 
specimen, amniocentesis samples and autopsy material. From these cells, genomic DNA, mRNA, 

35 cDNA, RNA, and/or protein can be isolated. 

Hybridization: Nucleic acid molecules that are complementary to each other hybridize by 
hydrogen bonding, which includes Watson-Crick, Hoogsteen or reversed Hoogsteen hydrogen 
bonding between complementary nucleotide units. For example, adenine and thymine are 
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complementary nucleobases that pair through formation of hydrogen bonds. "Complementary" refers 
to sequence complementarity between two nucleotide units. For example, if a nucleotide unit at a 
certain position of an oligonucleotide is capable of hydrogen bonding with a nucleotide unit at the 
same position of a DNA or RNA molecule, then the oligonucleotides are complementary to each 
other at that position. The oligonucleotide and the DNA or RNA are complementary to each other 
when a sufficient number of corresponding positions in each molecule are occupied by nucleotide 
units which can hydrogen bond with each other. 

"Specifically hybridizable" and "complementary" are terms mat indicate a sufficient degree 
of complementarity such mat stable and specific binding occurs between the oligonucleotide and the 
DNA or RNA or PNA target. An oligonucleotide need not be 100% complementary to its target 
nucleic acid sequence to be specifically hybridizable. An oligonucleotide is specifically hybridizable 
when binding of the oligonucleotide to the target DNA or RNA molecule interferes with the normal 
function of the target DNA or RNA and there is a sufficient degree of complementarity to avoid non- 
specific binding of the oligonucleotide to non-target sequences under conditions in which specific 
15 binding is desired, for example under physiological conditions in me case of in vivo assays, or under 
conditions in which the assays are performed. 

Hybridization conditions resulting in particular degrees of stringency will vary depending 
upon the nature of the hybridization method of choice and the composition and length of the 
hybridizing DNA used. Generally, the temperature of hybridization and the ionic strength (especially 
the Na + concentration) of the hybridization buffer will determine the stringency of hybridization. 
Calculations regarding hybridization conditions required for attaining particular degrees of stringency 
are discussed by Sambrook et al. mMolecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory Press (1989), chapters 9 and 1 1, herein incorporated by reference. 

In vitro amplification: Techniques that increase the number of copies of a nucleic acid 
molecule in a sample or specimen. An example of in vitro amplification is the polymerase chain 
reaction, in which a biological sample collected from a subject is contacted with a pair of 
oligonucleotide primers, under conditions that allow for the hybridization of the primers to nucleic 
acid template in the sample. The primers are extended under suitable conditions, dissociated from the 
template, and then re-annealed, extended, and dissociated to amplify the number of copies of the 
30 nucleic acid. 

The product of in vitro amplification may be characterized by electrophoresis, restriction 
endonuclease cleavage patterns, oligonucleotide hybridization or ligation, and/or nucleic acid 
sequencing, using standard techniques. 

Other examples of in vitro amplification techniques include strand displacement 
35 amplification (see U.S. Patent No. 5,744,31 1); transcription-free isothermal amplification (see U.S. 
Patent No. 6,033,881); repair chain reaction amplification (see WO 90/01069); ligase chain reaction 
amplification (see EP-A-320 308); gap filling ligase chain reaction amplification (see U.S. Patent No 
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5,427,930); coupled ligase detection and PGR (see U.S. Patent No. 6,027,889); and NASBA™ mA 
transcnption-free amplification (see U.S. Patent No. 6,025,134). 

Isoform: As used herein, the term isoform refers to a protein with a unique amino acid 
sequence specified by one haplotype of a gene, such as a T2R bitter receptor gene By way of 
example, specific examples of T2R isoforms are shown in the sequence listing, SEQ ID NOs- 48-264 
(even). 

isolated: An "isolated" biological component (such as a nucleic acid molecule, protein or 
organelle) has been substantially separated or purified away from other biological components in the 
cell of the organism in which the component naturally occurs, i.e., other chromosomal and extra- 
chromosomal DNA and RNA, proteins and organelle, Nucleic acids and proteins that have been 
'isolated" include nucleic acids and proteins purified by standard purification methods. The term 
also embraces nucleic acids andproteins prepared by recombinant expression in a host cell as well as 
chemically synthesized nucleic acids. 

Label: Detectable marker or reporter molecules, which canbe attached to nucleic acids 
Typical labels include fluorophores, radioactive isotopes, ligands, chenmuminescent agents, metal 
sols and colloids, and enzymes. Methods for labeling and guidance in the choice of labels useful for 
various purposes are discussed, eg., in Sambrook et al, in Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory Press (1989) and Ausubel etal, in Current Protocols in 
Molecular Biology, Greene Publishing Associates and Wiley-Intersciences (1987) 

Mutation: Any change of the DNA sequence within a gene or chromosome. In some 
instances, a mutation will alter a characteristic or trait (phenotype), but this is not always the case 
Types of mutations include base substitution point mutations (e.g., transitions or Aversions) ' 
deletions, and insertions. Missense mutations are those that introduce a different amino acid into the 
sequence of the encoded protein; nonsense mutations are those that introduce a new stop codon In 
the case of insertions or deletions, mutations can be in-frame (not changing the frame of the overall 
sequence) or frame shift mutations, which may result in the misreading of a large number of codons 
(and often leads to abnormal termination of the encoded product due to the presence of a stop codon 
in the alternative frame). 

This term specifically encompasses variations that arise through somatic mutation, for 
mstance those that are found only in disease cells, but not constitutionally, in a given individual 
Examples of such somatically-acquired variations include the point mutations mat frequently result in 
altered function of various genes that are involved in development of cancers. This term also 
encompasses DNA alterations that are present constitutionally, that alter the function of the encoded 
protem m a readily demonstrable manner, and that can be inherited by the children of an affected 
indmdual. In this respect, the term overlaps with "polymorphism," as defined below, but generally 
refers to the subset of constitutional alterations. 
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Nucleic acid: A deoxyribonucleotide or ribonucleotide polymer in either single or double 
stranded form, and unless otherwise limited, encompassing known analogues of natural nucleotides 
that hybridize to nucleic acids in a manner similar to naturally occurring nucleotides. 

Nucleic acid array: An arrangement of nucleic acids (such as DNA or RNA) in assigned 
locations on a matrix, such as that found in cDNA arrays, or oligonucleotide arrays. 

Nucleic acid molecules representing genes: Any nucleic acid, for example DNA (intron or 
exon or both), cDNA or RNA, of any length suitable for use as a probe or other indicator molecule, 
and that is informative about the corresponding gene. 

Nucleotide: "Nucleotide" includes, but is not limited to, a monomer that includes a base 
linked to a sugar, such as a pyrimidine, purine or synthetic analogs thereof, or a base linked to an 
amino acid, as in a peptide nucleic acid (PNA). A nucleotide is one monomer in a polynucleotide. A 
nucleotide sequence refers to the sequence of bases in a polynucleotide. 

Oligonucleotide: A linear single-stranded polynucleotide sequence ranging in length from 
2 to about 5,000 bases, for example a polynucleotide (such as DNA or RNA) which is at least 6 
15 nucleotides, for example at least 10, 12, 15, 18, 20, 25, 50, 100, 200, 1,000, or even 5,000 nucleotides 
long. Oligonucleotides are often synthetic but can also be produced from naturally occurring 
polynucleotides. 

An oligonucleotide analog refers to moieties that function similarly to oligonucleotides but 
have non-naturally occurring portions. For example, oligonucleotide analogs can contain non- 
20 naturally occurring portions, such as altered sugar moieties or inter-sugar linkages, such as a 

phosphorothioate oligodeoxynucleotide. Functional analogs of naturally occurring polynucleotides 
can bind to RNA or DNA and include peptide nucleic acid (PNA) molecules. Such analog 
molecules may also bind to or interact with polypeptides or proteins. 

Operably linked: A first nucleic acid sequence is operably linked with a second nucleic 
acid sequence when the first nucleic acid sequence is placed in a functional relationship with the 
second nucleic acid sequence. For instance, a promoter is operably linked to a coding sequence if the 
promoter affects the transcription or expression of the coding sequence. Generally, operably linked 
DNA sequences are contiguous and, where necessary to join two protein-coding regions, in the same 
reading frame. 

Open reading frame (ORF): A series of nucleotide triplets (codons) coding for amino 
acids without any internal termination codons. These sequences are usually translatable into a 
peptide. 

Peptide Nucleic Acid (PNA): An oligonucleotide analog with a backbone comprised of 
monomers coupled by amide (peptide) bonds, such as amino acid monomers joined by peptide bonds. 

Pharmaceutkally acceptable carriers: The pharmaceutically acceptable carriers useful 
with compositions provided herein are conventional. By way of example, Martin, in Remington 's 
Pharmaceutical Sciences, published by Mack Publishing Co., Easton, PA, 19th Edition, 1995, 
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describes compositions and formulations suitable for pharmaceutical delivery of the nucleotides and 
proteins herein disclosed. 

In general, the nature of the carrier will depend on the particular mode of administration 
being employed. For instance, parenteral formulations usually comprise injectable fluids that include 
5 pharmaceutically and physiologically acceptable fluids such as water, physiological saline, balanced 
salt solutions, aqueous dextrose, glycerol or the like as a vehicle. For solid compositions (e.g., 
powder, pill, tablet, or capsule forms), conventional non-toxic solid carriers can include, for example, 
pharmaceutical grades of mannitol, lactose, starch, or magnesium stearate. In addition to 
biologically-neutral carriers, pharmaceutical compositions to be administered can contain minor 

10 amounts of non-toxic auxiliary substances, such as wetting or emmsifying agents, preservatives, and 
pH buffering agents and the like, for example sodium acetate or sorbitan monolaurate. 

Polymorphism: Variant in a sequence of a gene, usually carried from one generation to 
another in a population. Polymorphisms can be those variations (nucleotide sequence differences) 
that, while having a different nucleotide sequence, produce functionally equivalent gene products, 

15 such as those variations generally found between individuals, different ethnic groups, geographic 
locations. The term polymorphism also encompasses variations that produce gene products with 
altered function, i.e., variants in the gene sequence that lead to gene products that are not functionally 
equivalent. This term also encompasses variations that produce no gene product, an inactive gene 
product, or increased or increased activity gene product 

20 Polymorphisms can be referred to, for instance, by the nucleotide position at which the 

variation exists, by the change in amino acid sequence caused by the nucleotide variation, or by a 
change in some other characteristic of the nucleic acid molecule or protein that is linked to the 
variation (e.g., an alteration of a secondary structure such as a stem-loop, or an alteration of the 
binding affinity of the nucleic acid for associated molecules, such as polymerases, RNases, and so 

25 forth). 

Probes and primers: Nucleic acid probes and primers can be readily prepared based on the 
nucleic acid molecules provided as indicators of taste reception or likely taste reception. It is also 
appropriate to generate probes and primers based on fragments or portions of these nucleic acid 
molecules, particularly in order to distinguish between and among different aUeles and haplotypes 
within a single gene. Also appropriate are probes and primers specific for the reverse complement of 
these sequences, as well as probes and primers to 5' or 3' regions. 

A probe comprises an isolated nucleic acid attached to a detectable label or other reporter 
molecule. Typical labels include radioactive isotopes, enzyme substrates, co-factors, ligands, 
chemiluminescent or fluorescent agents, haptens, and enzymes. Methods for labeling and guidance in 
35 the choice of labels appropriate for various purposes are discussed, e.g., in Sambrook et al (In 

Molecular Cloning: A Laboratory Manual, CSHL, New York, 1989) and Ausubel et al (In Current 
Protocols in Molecular Biology, John Wiley & Sons, New York, 1998). 
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85:2444, 1988; Higgins & Sharp, Gene, 73:237-44, 1988; Higgins & Sharp, CABIOS 5:151-3, 1989; 
Corpet et al, Nuc. Acids Res. 16:10881-90, 1988; Huang et al Computer Appls. Biosci. 8, 155-65, 
1992; and Pearson etal.,Meth. Mol. Bio. 24:307-31, 1994. Altschul et al, J. Mot Biol. 215:403-10, 
1990, presents a detailed consideration of sequence alignment methods and homology calculations. ' 

The NCBI Basic Local Alignment Search Tool (BLAST) (Altschul et al, J. Mol. Biol. 
215:403-10, 1990) is available from several sources, including the National Center for Biotechnology 
Information (NCBI, Belhesda, MD) and on the Internet, for use in connection with the sequence 
analysis programs blastp, blastn, blastx, tblasta and tblastx. Each of these sources also provides a 
description of how to determine sequence identity using this program. 

Homologous sequences are typically characterized by possession of at least 60%, 70%, 75%, 
80%, 90%, 95% or at least 98% sequence identity counted over the full length alignment with a' 
sequence using the NCBI Blast 2.0, gapped blastp set to default parameters. Queries searched with 
the blastn program are filtered with DUST (Hancock and Armstrong, Comput. Appl Biosci. 10:67-70, 
1994). It will be appreciated that these sequence identity ranges are provided for guidance only; it is ' 
entirely possible that strongly significant homologs could be obtained that fall outside of the ranges 
provided. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences, due to the degeneracy of the genetic code. It is understood that 
changes in nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
20 sequences that all encode substantially the same protein. 

An alternative indication that two nucleic acid molecules are closely related is that the two 
molecules hybridize to each other under stringent conditions, as described under "specific 
hybridization." 

Single Nucleotide Polymorphism (SNP): A single base (nucleotide) difference in a 
specific location in the DNA sequence among individuals in a population. A subset of SNPs give rise 
to changes in the encoded amino acid sequence; these are referred to as coding SNPs, or cSNPs. 

Specific binding agent: An agent that binds substantially only to a defined target. Thus a 
protein-specific bmding agent binds substantially only the specified protein. By way of example, as 
used herein, the term "X-protein specific binding agent" includes anti-X protein antibodies (and ' 
functional fragments thereof) and other agents (such as soluble receptors) that bind substantially only 
to the X protein (where «X" is a specified protein, or in some embodiments a specified domain or 
form of a protein, such as a particular allelic form of a protein). 

Anti-X protein antibodies may be produced using standard procedures described in a number 
of texts, including Harlow and Lane {Antibodies, A Laboratory Manual, CSHL, New York, 1988). 
The determination that a particular agent binds substantially only to the specified protein may readily 
be made by using or adapting routine procedures. One suitable in vitro assay makes use of the 
Western blotting procedure (described in many standard texts, including Harlow and Lane 
{Antibodies, A Laboratory Manual, CSHL, New York, 1988)). Western blotting may be used to 
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determine that a given protein binding agent, such as an anti-X protein monoclonal antibody, binds 
substantially only to the X protein. 

Shorter fragments of antibodies can also serve as specific binding agents. For instance, 
Fabs, Fvs, and single-chain Fvs (SCFvs) that bind to a specified protein would be specific binding 
agents. These antibody fragments are defined as follows: (1) Fab, the fragment which contains a 
monovalent antigen-binding fragment of an antibody molecule produced by digestion of whole 
antibody with the enzyme papain to yield an intact light chain and a portion of one heavy chain; (2) 
Fab', the fragment of an antibody molecule obtained by treating whole antibody with pepsin, 
followed by reduction, to yield an intact light chain and a portion of the heavy chain; two Fab' 
fragments are obtained per antibody molecule; (3) (Fab')* the fragment of the antibody obtained by 
treating whole antibody with the enzyme pepsin without subsequent reduction; (4) Ffrb')* a dimer of 
two Fab' fragments held together by two disulfide bonds; (5) Fv, a genetically engineered fragment 
containing the variable region of the light chain and the variable region of the heavy chain expressed 
as two chains; and (6) single chain antibody ("SCA"), a genetically engineered molecule containing 
the variable region of the light chain, the variable region ofthe heavy chain, linked by a suitable 
polypeptide linker as a genetically fused single chain molecule. Methods of making these fragments 
are routine. 

Specific hybridization: Specific hybridization refers to the binding, duplexing, or 
hybridizing of a molecule only or substantially only to a particular nucleotide sequence when that 
sequence is present in a complex mixture {e.g. total cellular DNA or RNA). Specific hybridization 
may also occur under conditions of varying stringency. 

Hybridization conditions resulting in particular degrees of stringency will vary depending 
upon the nature of the hybridization method of choice and the composition and length ofthe 
hybridizing DNA used. Generally, the temperature of hybridization and the ionic strength (especially 
the Na+ concentration) of the hybridization buffer will determine the stringency of hybridization. 
Calculations regarding hybridization conditions required for attaining particular degrees of stringency 
are discussed by Sambrook etal. (In: Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor, New York, 1989 ch. 9 and 11). By way of illustration only, a hybridization experiment may 
be performed by hybridization of a DNA molecule to a target DNA molecule which has been 
electrophoresed in an agarose gel and transferred to a nitrocellulose membrane by Southern blotting 
(Southern, J. Mol.Biol. 98:503, 1975), a technique well known in the art and described in Sambrook 
et al. {Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York, 1989). 

Traditional hybridization with a target nucleic acid molecule labeled with [ 32 P]-dCTP is 
generally carried out in a solution of high ionic strength such as 6 x SSC at a temperature that is 20- 
25° C below the melting temperature, T„ described below. For Southern hybridization experiments 
where the target DNA molecule on the Southern blot contains 10 ng of DNA or more, hybridization 
is typically carried out for 6-8 hours using 1-2 ng/ml radiolabeled probe (of specific activity equal to 
10* CPM/ug or greater). Following hybridization, the nitrocellulose filter is washed to remove 
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background hybridization. The washing conditions should be as stringent as possible to remove 
background hybridization but to retain a specific hybridization signal. 

The term T m represents the temperature (under defined ionic strength, pH and nucleic acid 
concentration) at which 50% of the probes complementary to the target sequence hybridize to the 
target sequence at equilftrium Because the target sequences are generally present in excess, at T m 
50% of the probes are occupied at equilibrium. The T m of such a hybrid molecule may be estimated 
from the following equation (Bolton and McCarthy, Proc. Natl. Acad. Sci. USA 48:1390, 1962): 

T m = 81.5° C - 16.6(log 10 [Na + ]) + 0.41(% G+C) - 0.63(% formamide) - (600/1) 

where 1 = the length of the hybrid in base pairs. 

This equation is valid for concentrations of Na + in the range of 0.01 M to 0.4 M, and it is 
less accurate for calculations of Tm in solutions of higher [Na^. The equation is also primarily valid 
for DNAs whose G+C content is in the range of 30% to 75%, and it applies to hybrids greater than 
100 nucleotides in length (the behavior of oligonucleotide probes is described in detail in Ch. 1 1 of 
Sambrook et al (Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York, 1989). 

Thus, by way of example, for a 150 base pair DNA probe derived from a cDNA (with a 
hypothetical % GC of 45%), a calculation of hybridization conditions required to give particular 
stringencies may be made as follows: For this example, it is assumed that the filter will be washed in 
0.3 x SSC solution following hybridization, thereby: [Na+] = 0.045 M; %GC = 45%; Formamide 
concentration = 0; 1 = 150 base pairs; Tm=81.5 - 16.6(log 10 [Na+]) + (0.41 x 45) - (600/150); and so 
Tm = 74.4° C. 

The T m of double-stranded DNA decreases by 1-1.5° C with every 1% decrease in homology 
(Bonner et al, J. Mol. Biol. 81: 123, 1973). Therefore, for this given example, washing the filter in 
0.3 x SSC at 59.4-64.4° C will produce a stringency of hybridization equivalent to 90%; that is, DNA 
molecules with more than 1 0% sequence variation relative to the target cDNA will not hybridize. 
Alternatively, washing the hybridized filter in 0.3 x SSC at a temperature of 65.4-68.4° C will yield a 
hybridization stringency of 94%; that is, DNA molecules with more than 6% sequence variation 
relative to the target cDNA molecule will not hybridize. The above example is given entirely by way 
of theoretical illustration. It will be appreciated that other hybridization techniques may be utilized 
and that variations in experimental conditions will necessitate alternative calculations for stringency. 

Stringent conditions may be defined as those under which DNA molecules with more than 
25%, 15%, 10%, 6% or 2% sequence variation (also termed "mismatch") will not hybridize. 
Stringent conditions are sequence dependent and are different in different circumstances. Longer 
sequences hybridize specifically at higher temperatures. Generally, stringent conditions are selected 
to be about 5° C lower than the thermal melting point T m for the specific sequence at a defined ionic 
strength and pH. An example of stringent conditions is a salt concentration of at least about 0.01 to 
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1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3 and a temperature of at least about 30° C 
forshortprobes(e.£. 10 to 50 nucleotides). Stringent conditions can also be achieved with die 
addition of destabilizing agents such as formamide. For example, conditions of 5 X SSPE (750 mM 
NaCI, 50 mM Na Phosphate, 5 mM EDTA, pH 7.4) and a temperature of 25-30' C are suitable for 
5 allele-specific probe hybridizations. 

A perfectly matched probe has a sequence perfectly complementary to a particular target 
sequence. The test probe is typically perfectly complementary to a portion (subsequence) of the 
target sequence. The term "mismatch probe" refers to probes whose sequence is deliberately selected 
not to be perfectly complementary to a particular target sequence. 
1 0 Transcription levels can be quantitated absolutely or relatively. Absolute quantitation can be 

accomplished by inclusion of known concentrations of one or more target nucleic acids (for example 
control nucleic acids or with a known amount the target nucleic acids themselves) and referencing the 
hybridization intensity of unknowns with the known target nucleic acids (for example by generation 
of a standard curve). 

15 Subject: Living, multicellular vertebrate organisms, a category that includes both human 

and veterinary subjects for example, mammals, birds and primates. 

Transformed: A transformed cell is a cell into which has been introduced a nucleic acid 
molecule by molecular biology techniques. As used herein, the term transformation encompasses all 
techniques by which a nucleic acid molecule might be introduced into such a cell, including 
20 transfection with viral vectors, transformation with plasmid vectors, and introduction of naked DNA 
by electroporation, lipofection, and particle gun acceleration 

Vector: A nucleic acid molecule as introduced into a host cell, thereby producing a 
transformed host cell. A vector may include nucleic acid sequences that permit it to replicate in a 
host cell, such as an origin of replication. A vector may also include one or more selectable marker 
25 genes and other genetic elements known in the art. 

Unless otherwise explained, ah technical and scientific terms used herein have the same 
meamng as commonly understood by one of ordinary skill in the art to which this invention belongs 
The singular terms "a," "an," and "the" include plural referents unless context clearly indicates 

30 otherwise. Similarly, the word "or" is intended to include "and" unless the context clearly indicates 
otherwise. Hence "comprising A or B" means including A, or B, or A and B. It is further to be 
understood that all base sizes or amino acid sizes, and all molecular weight or molecular mass values 
given for nucleic acids or polypeptides are approximate, and are provided for description Although ' 
methods and materials similar or equivalent to those described herein can be used in the practice or 

35 testing of the present invention, suitable methods and materials are described below. All 

publications, patent applications, patents, and other references mentioned herein are incorporated by 
reference in their entirety. In case of conflict, the present specification, including explanations of 
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terms, will control. In addition, the materials, methods, and examples are illustrative only and not 
intended to be limiting. 

///. Overview, Variants of Human Taste Receptor Genes 

The inventors herein have discovered many novel polymorphic sites (polymorphisms, SNPs) 
in the T2R genes. These SNPs are listed in Figure 1. m addition, the inventors have determined the 
haplotypes for 22 of the T2R genes; these haplotypes are listed in Table 7. The haplotypes were 
identified from related individuals from the Utah Genetic Reference Project (consisting of individuals 
of Northern European ancestry), unrelated individuals from the NIH (of European, Asian African 
American, and Native American ancestry), and unrelated individuals in five different geographic 
populations, including Cameroonians, Amerindians, Japanese, Hungarians, and Pygmies 
Distributions and frequency of SNPs and haplotypes in the various populations are shown below for 
instance in Tables 2, 3, 4, and 5B (SNPs and haplotypes for T2R38), and Table 7B (haplotypes for 22 
T2R genes). Each of the identified T2R haplotypes defines a naturally occurring variant (isoforms) 
15 of the corresponding T2R gene that exists in a humanpopulation. 

Thus, in one embodiment there are provided methods, compositions and kits for genotyping 
one or more T2R gene(s) in an individual. The genotyping method comprises identifying the 
nucleotide pair that is present at one or more variant sites selected from the group listed in Figure 1 
mbothcopiesoftheselectedT2Rgene(s)fromtheindividual. Examples of such methods further ' 
compnse identifying the nucleotide pairs at all variant sites within any one T2R gene. Specific 
contemplated genotyping compositions comprise an oligonucleotide probe or primer that overlaps 
and is designed to specifically hybridize to a target region containing, or adjacent to, one of the listed 
T2R SNP sites, for instance specifically one of the SNPs that is referred to in Figure 1 as newly 
identifiedbytheinventors. A representative genotyping kit comprises one or more 
ohgonucleotide(s) designed to genotype one or more of the T2R SNP sites. Examples of such kits 
include at least one oligonucleotide designed to genotype a single T2R gene at all identified SNP sites 
(which is also useful in haplotying the individual). Other examples of such kits include at least one 
oligonucleotide designed to genotype at least one SNP within each of the 23 provided T2R genes 
One specific example is a kit that comprises at least one oligonucleotide designed to genotype each 
and every SNP described herein. The provided genotyping methods, compositions, and kits are 
useful, for instance, for identifying an individual, or collection of individuals, that has one of the 
genotypes or haplotypes described herein. 

Also provided herein are methods for haplotyping 22 T2R genes, singly or in combination 
with two or more of the set, in an individual. In examples of such methods, the method comprises 
determining, the identity of the nucleotide at one or more SNP sites (such as those listed in Figure 1) 
for one copy or both copies (also referred to as diplotyping) of the chosen T2R gene(s). In specific 
examples of such methods, it is determined whether at least one copy of at least one T2R gene in the 
mdividual's AGTR1 gene corresponds to one of the haplotypes shown in Table 7, below 
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It is specifically contemplated that more than one T2R gene can be haplotyped (or 
genotyped) in the individual. By way of example, all of the T2R genes listed herein (T2R1 T2R3 
T2R4, T2R5, T2R7, T2R8, T2R9, T2R10, T2R13, T2R14, T2R16, T2R38, T2R39, T2R40 't2R4i 
T2R43, T2R44, T2R45, T2R46, T2R47, T2R48, T2R49, T2R50, and T2R60) may be haplotyped (or 
genotyped at one or more SNP positions) in a single individual. Alternatively, any subset of T2R 
genes may be haplotyped/genotyped. 

Forexample,meha P lotypmgmemodcanbeusedtovaUdateaspecmcT2Rprotein or 
isoform (as defined by the provided haplotypes) as a candidate target for a ligand, such as a bitter 
tasting compound, or a blocker or other compound that interferes with or influences perception of 
bitter taste. Determining for a particular population the frequency of one or more of the individual 
T2R haplotypes or haplotype pairs described herein will facilitate a decision on whether to pursue it 
as a target for influencing taste perception, for instance to alter medicine, food or drink preparations 
in a way particularly suited to a given population. 

If variable T2R activity or tastant binding is associated with perception of (or failure to 
perceive) a bitter tastant, then one or more T2R haplotypes or haplotype pairs is expected to be found 
at a higher frequency in taster (or non-taster) cohorts than in appropriately genetically matched 
control individuals. This is illustrated herein with the T2R38 gene (also referred to as the PTC 
receptor). The practitioner or other individual, without a priori knowledge as to the phenotypic effect 
of any specific T2R haplotype or haplotype pair, can apply the information derived from detecting 
T2R haplotypes in an individual to decide whether modulating activity of the chosen T2R would be 
expected to be useful in influencing taste in an individual or a population. Various methods are 
provided herein for testing whether a compound or ligand interacts with a specific T2R 
isoform/variant, including ex vivo systems and in vivo systems. Some of these systems measure 
perceived taste or changes thereto directly; others measure an upstream signal for taste perception, 
such as for instance release of intraceUular calcium based on the activity of the T2R or another 
protein in the taste perception pathway. 

The provided T2R SNPs and haplotypes are also useful in screening for compounds 
targeting a T2R (or family of T2R) protein to influence a phenotype associated with the T2R isoform, 
such as perception of a taste such as a bitter taste. For example, detecting which of the T2R 
haplotypes disclosed herein are present in individual members of a target population with enables the 
practitioner or other individual to screen for a compounds) that displays the highest desired agonist 
or antagonist activity for each of the T2R isoforms present in the target population, or the most 
common isoforms present in the target population. Thus, without requiring any a priori knowledge of 
the phenotypic effect of any particular T2Rhaplotype, the provided haplotyping methods provide the 
practitioner or other individual with a tool to identify lead compounds that are more likely to show 
efficacy in influencing taste perception. 

The method for haplotyping one or more T2R gene(s) in an individual is also useful in the 
design of trials of candidate compounds for influencing perception of taste, particularly bitter taste, 
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that predxcted to be associated with T2R activity. For example, instead of randomly assigning 
obj ects to the test or control group as is typically done now, detennining which of the T2R 
ha P lotype(s) disclosed herein are present in individuals in the study enables one to select the 
drstribution of T2R haplotypes and/or sets or T2R haplotypes to test and control groups, thereby 
con*!^ 

l^Rh.plo^eorsetofhaplotypesthatbadapreviouslyu^^^ 

tastant or other hgand being studied. Thus, with the information provided herein, one can more 

^y^ontne results of me «al, wimout needing to fhst determine the specific phenotypic 
effect ofanyT2Rhaplotype or haplotype pair. 

T2* tTT PrOVideS 3 ^ « aSS ° Ciation bet — a trait and a 

^Rgenotype, haplotype, or set of haplotypes for one or more of the T2R g enes described herein 

mapo^one^tmgthe «<^~«^.f. eBW „ 1 ^ rf 
T2R .soform) wxth the frequency of the T2R genotype or haplotype in a reference population A 
higher frequency of the T2R genotype, haplotype, or set of haplotypes in the population having the 
traxt than m the reference population indicates the trait is associated with the T2R genotype 
haplotype, or set of haplotypes. In examples of such methods, the T2R SNP is selected from a SNP 

^ 1 " ^ identified by thC ™"*ors, or the T2R haplotype is selected 
from the haplotypes shown in Table 7. Such methods have applicability, for instance, in developing 
dtagnostxc tests for taste perception and development and identification of compounds useful for 
-nfluencing taste, particularly perception of bitter taste, for instance in a specific target population 
Yet another embodiment is an isolated polynucleotide comprising a nucleotide sequence 

Pearly a fragment of 1 0 or more contiguous nucleotides that overlap a SNP identified herein 
The reference sequence for each T2R gene is indicated by GenBank Accession number herein, for 

Fxgure 1 , and particularly relevant are those SNPs indicated as new in that figure Specific 

T2Rallele,wheremthenucleotidesequenceisselectedfromSEQIDNO: 3,7 9 11 13 15 19 2 , 

^25,27,29,31,33,35,39,41,45,49,55,57,59,63,65,67,09,71, 75, ^^W^» 91 
93, 95, 99, 101, 103, 105, 107, 111, „ 3 , 115 , 117 , 119> m> ^ ^ ^ ' ' 

149, 151, 155, 157, 161, 163, 169, 173, 175, 177, 179, 181, 183, 187, 189, 91, 97, 1 ^ i ' 

T2R allele vanantsoperably linked to expression regulatory elements, and recombinant host cells 
transformed or transfected with such an expression vector. The recombinant vector and host cell may 
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be used, for instance, to express a T2R isoform for protein structure analysis and compound binding 
studies, as discussed more fully herein. 

Also provided are T2R polypeptide isofonns, which comprise a polymorphic variant of a 
reference amino acid sequence for a T2R protein. The reference sequence for each T2R protein is 
mchcated by GenBank Accession number herein, for instance in the brief description of Sequence 
lasting. Polymorphisms in T2R proteins are indicated in Figure 1, and particularly relevant are those 
cSNPs indicated as new in that figure, which cause a change in the protein sequence and therefore 
result in a new T2R isoform. T2R variants are useful in studying the effect of the variation on the 
bxological activity of the T2R, as well as on the binding affinity of candidate compounds (eg. 
tastants) targeting T2R for influence perception of bitter taste. 

Also provided are T2R sequence anthologies, which are collections of T2R alleles or 
isoforms found in a selected population. The population may be any group of at least two 
mdividuals, including but not limited to a reference population, a target population, a geographic 
populate (e.g., based on continent, country, region, and so forth), a family population, a clinical 
population, and a sex-selected population. A T2R sequence anthology may comprise individual T2R 
haplotype nucleic acid molecules stored in separate containers such as tubes, separate wells of a 
microtiter plate and the like. ^dividual aUele nucleic acid molecules or isoforms, or groups of such 
molecdes, in a T2R sequence anthology, may be stored in any convenient and stable form, including 
but not limited to in buffered solutions, as DNA precipitates, freeze-dried preparations and the like 
A specific contemplated T2R sequence anthology comprises the set of haplotypes (or the encoded ' 
isoforms) shown in Table 7A. Also contemplated are anthologies that comprise subset, of T2R 
sequences, such as for instance a set of all of the haplotypes (or encoded isoforms) for a single T2R 
gene, or a set of at least one haplotype (or encoded isoform) for each T2R gene, and so forth ' 

Another embodiment provides specific binding agents, such as antibodies, that recognize 
and specifically bind to one of the variant T2R proteins described herein. 

Yet another embodiment is a nonhuman transgenic animal, comprising at least one 
polymorphic genomic T2R variant allele described herein, as well as methods for producing such 
ammals. The transgenic animals are useful for studying expression of the T2R isofonns in vivo for 
screenmg and testing of compositions targeted against the T2R protein, and for analyzing the ' 
effectless of agents and compounds for influencing taste, for instance blocking bitter taste in a 
biological system. 

particularly haplotype, data determined for T2R genes as described herein. A typical computer 
system includes a computer processing unit, a display, and a database containing the data 
Representative T2R polymorphism data includes T2R SNPs (such as those listed in Figure 1) T2R 
genotypes, T2R haplotypes (such as those listed in Table 7) and population or other information 
about the mdividuals in one or more populations. 
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IV. Representative Uses o/T2R SNPs and Haplotypes 

Identifying receptor-ligand relationships has been difficult and the nature of the ligand that 
binds to each receptor and initiates hitter taste perception is known for only a few of these receptors 
In humans, in vitro studies have shown that T2R1 6 responds to salicin and other beta- 

revealed thatT2R38 encodes the receptor for phenylthiocarbaxnide (PTC), a classic variant trait in 
human, Distinct phenotypes have been clearly associated only with specific haplotypes of the PTC 
receptor and there are now five SNPs described corresponding to seven haplotypes, including taster 
non-taster and intermediate alleles (see, e. g ., PCT/US02/23 172, published as WO 03/008627, which 

10 .mcorporatedhereinbyreference). The non-taster allele may encode an isoform that serves as a 
funchonal receptor for another as yet unidentified toxic bitter substance. T2R3 8 (PTC) studies 
suggest that there may be substantial additional complexity in the task of identifying specific ligands 
for each bitter taste receptor, as different alleles of each gene may encode receptors that recognize 
Afferent ligands. However merely identifying in isolation the many different DNA variants (SNPs) 

15 in these genes is less useful as these variants could possibly exist in a huge number of different 

genetically-linked combinations (haplotypes) able to encode a correspondingly huge number of bitter 
taste receptors expressed on the surface of the tongue. (The number of different possible proteins 
mcreases as the number (N) of different cSNPs (SNPs able to give rise to changes in amino acid 
sequence) in the coding sequence of the gene by 2», where N is the number of different cSNPs in the 
20 coding sequence of the gene.) 

mdwidual sequence variants (SNPs) in isolation do not determine the receptor protein produced in a 
cell. For example, the three variant sites in T2R38 are capable of producing eight different protein 
isoforms, depending on the combination of variant forms present at each of the three sites (2> = eight 
25 potentialhaplotypesequences). h^tho.^^,^^^,^,,,^ 

duee possxble haplotypes do not exist in any population worldwide insofar as we can determine This 
becomes particularly important when a gene contains many coding sequence variants, such as T2R49 
There are eleven different coding SNP's in this gene, which could occur together in over 2000 
possible combinations, potentially producing over 2000 different forms of the T2R49 receptor 

analyzed, we have identified a total of 109 differ™* nmfom a- u i 

isoformsoftheT2Rp ro teins. °^ 109 different protein coding haplotypes, and thus 109 different 

It is believed that different T2R haplotypes encode receptor isoforms with different chemical 
>5 species for bitter tastants/ligands, analogous to the situation we have shown exists for T2R38 
Efforts are currendy ongoing worldwide to identify bitter tastants/ligands for each receptor The " 
results presented herein indicate that this cannot be viewed as an effort to decode ligands for only 22 
or23 different T2R gene, Instead the real nature of the problem is to decode the ligand(s) for each 
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of at least 109 different haplotypes. Experiments to identify ligand(s) (or blockers) for all possible 
haplotypes (which number many thousands, counting all 23 T2R genes together) are not practicable 
with current technologies, nor are they necessary if the vast majority of possible haplotypes/isoforms 
do not in fact occur in nature. Our information reveals the subset of possible haplotype sequences 
5 that are actually present in humans, and are thus worthy of further study. This will enable more rapid 
and efficient de-orphanizing of each of the taste receptors, as we have done for T2R38. 

Also reported herein are T2R haplotype frequencies in each of several populations, 
including Europeans, East Asians, and Africans. This information can be used to design foods and 
beverages for different worldwide markets, in two ways. First, in food and beverage research and 
10 development, population-specific haplotype distribution information will allow the selection of 
panels of taste sensors in a rational and efficient manner. This information will also be useful in 
either pure in vitro systems, or in panels of human volunteer tasters, who can be genotyped or 
selected using these discoveries. Second, knowledge of the genetic underpinnings in individual taste 
preferences in target populations will provide powerful predictive information for food and beverage 
15 palatabihty in different populations. Thus population-specific, and indeed even region-specific, 

anthologies or databases are now able to be developed using this information; these can provide T2R 
haplotype frequencies in regional or local populations. These resources can be used to improve both 
development and marketing decisions in the flavorings, food, and beverages industries. 

Also provided based on the discoveries described herein are methods and devices for high 
throughput analysis of T2R genotype and/or phenotype in an individual or group of individuals. A 
specific example of such a high throughput device is a DNA or protein microarray, which contains a 
collection of two or more T2R alleles or SNP-specific oligonucleotides (in the case of a DNA 
microarray) or isoform proteins or variant fragments thereof (in the case of a protein microarray). 
Examples of such arrays of molecules include at least one molecule representing each of the 109 
haplotypes listed in Table 7A. Specific example arrays include at least two sequences selected from 
SEQ ID NOs: 3, 7, 9, 11, 13, 15, 19, 21, 23, 25, 27, 29, 31, 33, 35, 39, 41, 45, 49, 55, 57, 59, 63, 65, 
67, 69, 71, 75, 77, 79, 81, 83, 85, 89, 91, 93, 95, 99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 
123, 125, 127, 129, 135, 139, 141, 147, 149, 151, 155, 157, 161, 163, 169, 173, 175, 177, 179, 181, 
183, 187, 189, 191, 197, 199, 201, 203, 205, 209, 211, 213, 217, 219, 225, 227, 229, 231, 233,' 23 5 ] 
237, 241, 243, 245, 247, 249, 251, 253, 257, 259, and 263, or an oligonucleotide comprising at least 6 
or at least 10 contiguous nucleotides selected from one of these sequences and which oligonucleotide 
overlaps at least one SNP as listed in Figure 1. Other specific example arrays include at least five 
such sequences, at least 10, at least 20, at least 30, at least 50 or more, including for instance all 81 of 
the following: SEQ ID NOs: 49, 55, 57, 59, 63, 65, 67, 69, 71, 75, 77, 79, 81, 83, 85, 89, 91, 93, 95, 
99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 135, 139, 141, 147, 149, ' 
151, 155, 157, 161, 163, 169, 173, 175, 177, 179, 181, 183, 187, 189, 191, 197, 199,201,203,205, 
209, 211, 213, 217, 219, 225, 227, 229, 231, 233, 235, 237, 241, 243, 245, 247, 249, 251, 253, 257^ 
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259, and 263, or an oligonucleotide fragment of each of these sequences which oligonucleotide 
overlaps at least one SNP form the sequence. 

By way of example, such arrays can be used in genotyping and haplotyping of individuals or 
groups of individuals. In certain embodiments, the results from such genotypmg/haplotyping is used 
to select a cohort of individuals of known genotype/haplotype for at least one T2R receptor (or a 
combmation of two or more T2R receptors, or all T2R receptors). These individuals could then be 
framed (as necessary) and used in flavor panel evaluation. Because the population of taste evaluators 
are of known (or partially known) genotype as relates to T2R receptor(s), a relatively small panel of 
tasters can provide results that can be extrapolated out to a large (e.g. , commercially relevant) 
population. Such large population is beneficially characterized by the frequency of occurrence of 
specific T2R isoforms/haplotypes, so that panels can be matched to the expected taste preference^) 
of the population. The teachings herein enable such methods of extrapolating me bitter taste receptor 
haplotype from a small group to a large (commerciaUy relevant) population, thus representing a 
savmgs in time and cost. Such an approach could be used, for instance, for "deorphanising" T2R 
receptors for specific bitter tastes/tastants or combinations thereof, for evaluating likely population 
response to tastante or blockers, or to characterize or develop new tastant molecules or blockers This 
would allow decisions about population-specific taste variation to aid decisions about worldwide 
marketing of specific flavorings and food and beverage products. 

Also contemplated are in vitro biochemical functional assays of T2R taste receptor function 
Such studies employ a variety of different assays, which produce information about G protein 
activation upon binding of tastant ligands to T2R rectors. One long term goal of such studies is the 
development of an "artificial tongue" that could be used to perform taste tests without the 
intervention of living humans as taste sensors. 

25 V. Ex vivo uses ofT2R bitter receptor haplotype-specific isoforms 

These haplotypes can be used to make protein expression constructions and generate 1 09 
uruque T2R bitter receptor proteins. These proteins can be arranged in a battery or an array to create 
a group of sensors for bitter tastant* ligands. Such an array could be employed in large parallel high- 
throughput systems, that would allow the testing of the effects of bitter tastant ligands on all forms of 

30 all receptors without the intervention of human tasters. 

These expressed isoform receptors can be Used in ex vivo reporter assays of several types 
One type is exemplified in the publication of Adler et al. Cell 100:693 (2000) (incorporated herein by 
reference in its entirety). The method employs calcium-sensitive dyes to assay the release of calcium 
from mtracellular stores in response to G protein activation by ligand binding to the expressed T2R 

35 receptor protein. Another contemplated method employs direct measurement of G protein activation 
by binding of a radioactive, nonhydrolyzable analog of OTP in a cell-free reconstituted system 
containing G proteins, T2R receptor, and ligand, as described by Sainz et al. Abstracts of the XXVI 
Meeting of the Association for Chemoreception Sciences, 211:55, 2004 (incorporated herein by 
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reference). Either of these system can be used, for instance in an array-based format, to identify or 
develop ligands that interact with T2R isoforms or sets of isofonns, or with specific T2R genes as 
wellas to identifyagentsthatinfluence the binding of suchligands. For instance, agents that reduce 
<«*. block) the binding of a ligand to a specific T2R isoform (or set thereof), or that compete with 
the binding of a known ligand, can be identified by a reduction in signal in a calcium-sensitive dye 
system, or by the reduction in binding of the radioactive GTP analog. Agents that increase or 
enhance the binding of a ligand can be identified by increased signals in either system. 

VI. Overview of Several Specific Embodiments 

10 Encompassed herein are isolated T2R variant-specific nucleic acid molecules, each of which 

comprise at least about 10 contiguous nucleotides that span (that is, include) at least one SNP 
adentified as new in Figure 1 . Also provided are arrays, which comprising two or more such nucleic 
add molecules. By way of example, such arrays can comprise at least one nucleic acid molecule 
compnsmg at least about 10 contiguous nucleotides from T2R1, T2R3, T2R4 T2R5 T2R7 T2R8 

15 T2R9, T2R10, T2R13, T2R14, T2R16, T2R38, T2R39, T2R40, T2R41, T2R43, T2R44 T2R45 ' 
T2R46, T2R47, T2R48, T2R49, T2R50, and T2R60, and spanning at least one SNP identified as new 
in Figure 1. Other examples of the array comprise at least one oligonucleotide from each T2R 
haplotype/aUele listed in Table 7. Specific examples of the arrays are in the format of microarrays 
Also provided are collections of two of more isolated T2R variant-specific nucleic acid 
20 ™^(-omerwords,spe^^ 

each nucleic acid molecule in the collection comprising at least about 1 0 contiguous nucleotides ' 
spanning at least one T2R SNP position listed in Table 7. Examples of such collections comprise at 

leastoneisolatedT2Rvariant-specmcnucleicacidmoleculefromT2Rl,T2R3,T2R4 T2R5 T2R7 
T2R8, T2R9, T2R10, T2R13, T2R14, T2R16, T2R38, T2R39, T2R40, T2R41, T2R4 3 't2R44 ' 

25 T2R46,T2R47,T2R48,T2R49,T2R50,andT2R60. Other collections comprise at least one isolated 
Vanant_SpeCifiC nucleic acid molecule &™ every SNP listed in Table 7. Still other collections 
comprise at least one isolated T2R variant-specific nucleic acid molecule from each of SEQ ID NO- 
49, 55, 57, 59, 63, 65, 67, 69, 71, 75, 77, 79, 81, 83, 85, 89, 91, 93, 95, 99, 101, 103, 105, 107 111 " 
113, 115, 117, 119, 121, 123, 125, 127, 129, 135, 139, 141, 147, 149, 151, 155, 157, 161, 163,' 169,' 

30 173, 175, 177, 179, 181, 183, 187, 189, 191, 197, 199, 201, 203, 205, 209, 21 1, 213, 217, 219 225 

227,229,231,233,235,237,241,243,245,247,249,251,253,257,259,^263. For instance; 
examples of such collections, ihe isolated T2R variant-specific nucleic acid molecules have a 
sequence as shown in SEQ ID NO: 49, 55, 57, 59, 63, 65, 67, 69, 71, 75, 77, 79, 81, 83, 85, 89 91 
93, 95, 99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 135, 139, 14l' 147 
* 149, 151, 155, 157, 161, 163, 169, 173, 175, 177, 179, 181, 183, 187, 189, 191, 197, 199, 201 2 3 

257 259 21 263 13, 219 ' 225 ' 227 ' ^ 231 ' 237 ' 241 ' ^ 247 > 249 ' 253 ^ 
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Optionally, in collections provided herein, each nucleic acid molecule is stored in a separate 
container. For instance, the separate containers in some embodiments are wells of a microtiter plate 
or equivalent thereof. In other embodiments, the nucleic acid molecules of the collections are affixed 
to a solid surface in an array, such as a microarray. 

In one embodiment, the microarray collection comprises nucleic acid molecules having the 
sequence as set for in SEQ ID NO: 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 

81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 
127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 
167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 
207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 
247, 249, 251, 253, 255, 257, 259, 261, and 263. 

Other specific contemplated collections contain isolated T2R variant-specific nucleic acid 
molecules taken from a single T2R gene. For instance, collections are contemplated wherein the 
molecules comprise : (a) SEQ ID NOs: 47, 49, and 51; (b) SEQ ID NOs: 53 and 55; (c) SEQ ID 
NOs: 57, 59, 61 , 63, 65, 67, 69, and 71; (d) SEQ ID NOs: 73, 75, 77, 79, 81, 83, and 85; (e) SEQ ID 
NOs: 87, 89, 91, 93, and 95; (f) SEQ ID NOs: 97, 99, 101, 103, 105, and 107;(g) SEQ ID NOs: 109, 
111, 113, 115, 117, 119, 121, and 123; (h) SEQ ID NOs: 125, 127, 129, and 131; (i) SEQ ID NOs: ' 
133 and 135; (j) SEQ ID NOs: 137, 139, and 141; (k) SEQ ID NOs: 143, 145, 147, 149, and 151; (1) 
SEQ ID NOs: 153, 155, 157, 159, 161, 163, and 165; (m) SEQ ID NOs: 167 and 169; (n) SEQ ID 
NOs: 171, 173, 175, and 179; (o) SEQ ID NOs: 181, 183, and 185; (p) SEQ ID NOs: 187, 189, 191, 
193, 195, 197, and 199; (q) SEQ ID NOs: 201, 203, 205, 207, 209, and 211; (r) SEQ ID NOs: 213, ' 
215, 217, and 219; (s) SEQ ID NOs: 221, 223, 225, 227, 229, 231, 233, 235, and 237; (t) SEQ ID ' 
NOs: 239, 241, 243, 245, 247, 249 and 251; (u) SEQ ID NOs: 253, 255, 257, and 259; (v) SEQ ID 
NOs: 261 and 263; or (w) a combination of two or more of (a) through (v). 

Also provided are isolated T2R polypeptide isoform fragment, such as polypeptide 
fragments encoded by an isolated T2R variant-specific nucleic acid molecule that comprises at least 
about 10 contiguous nucleotides that span (that is, include) at least one SNP identified as new in 
Figure 1. . 

Another embodiment is an isolated T2R isoform polypeptide fragment comprising an amino 
acid sequence comprising at least 10 contiguous amino acids of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 
16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 50, 56, 58, 60, 64, 66, 68,' 70, 72, 76, 78, 80, 

82, 84, 86, 90, 92, 94, 96, 100, 102, 104, 106, 108, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130^ 
136, 140, 142, 148, 150, 152, 156, 158, 162, 164, 170, 174, 176, 178, 180, 182, 184, 188, 190, 192, ' 
198, 200, 202, 204, 206, 210, 212, 214, 218, 220, 226, 228, 230, 232, 234, 236, 238, 242^ 244^ 246^ 

35 248, 250, 252, 254, 258, 260, or 264, which fragment includes at least one amino acid variation as let 
forth in Figure 1 or Table 7. 

Also provided are isolated T2R polypeptide isoforms, which comprise an amino acid 
sequence selected from SEQ ID NO: 50, 56, 58, 60, 64, 66, 68, 70, 72, 76, 78, 80, 82, 84, 86, 90, 92, 
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9 9 0, 102 , 104, 106, 108, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 136, 140, 142, 148, 
150, 152, 156, 158, 162, 164, 170, 174, 176, 178, 180, 182, 184, 188, 190, 192, 198, 200, 202 204 
206, 210, 212, 214, 218, 220, 226, 228, 230, 232, 234, 236, 238, 242, 244, 246, 248, 250 252 254' 
258, 260, or 264, and isolated nucleic acid molecules encoding such T2R polypeptide isoforms 
vectors comprising one of the isolated nucleic acid molecules, and host cells comprising such vectors 

Yet further embodiments are isolated nucleic acid molecules comprising a nucleotide 
sequence for a T2R allele, wherein the nucleotide sequence is selected from SEQ ID NO- 49 55 57 
59, 63, 65, 67, 69, 71, 75, 77, 79, 81, 83, 85, 89, 91, 93, 95, 99, 101, 103, 105, 107, 111 113 'l!5 ' 
117, 119, 121, 123, 125, 127, 129, 135, 139, 141, 147, 149, 151, 155, 157, 161, 163, 169, 173 175 
10 177, 179, 181, 183, 187, 189, 191, 197, 199, 201, 203, 205, 209, 211, 213, 217, 219, 225 227 229 
231, 233, 235, 237, 241, 243, 245, 247, 249, 251, 253, 257, 259, or 263, vectors comprising one of' 
the isolated nucleic acid molecules, and host cells comprising such vectors. 

A method of screening compounds useful for modulating bitter taste is also provided Such 
methods comprise contacting a test compound with a host cell (such as a eukaryotic cell, for instance 
a HEK293 cell) or membrane thereof that expresses a T2R taste receptor isoform encoded by an 
asolated nucleic acid molecule described herein; and detecting a change in the expression of the 
nucleotide sequence or a change in activity of the T2R taste receptor, or detecting binding of the 
compound to the T2R taste receptor or detecting a change in the electrical activity of the host cell or a 
change m intraceUular or extracellular cAMP, cGMP, IP3, or Ca 2+ of the host cell. In certain 

20 embodiments, the gene product of said nucleotide sequence is fused to a sequence that facilitates 

localization to the cell membrane, wherein that sequence is at least 20 consecutive N terminal amino 
acds of a rhodopsin protein. In examples of the screening methods, a change in intracellular Ca 2+ is 
detected by measuring a change in a calcium-sensitive dye dependent fluorescence in the cell In a 
preferred embodiment, a change in intraceUular Ca 2+ is detected by measuring a change in Fura-2 

25 fluorescence in the cell. 

Another example of such a screening method is a high throughput method, which method 
composes: contacting in parallel a test compound with a collection of host cells or membranes 
thereof each of which expresses a different T2R taste receptor isoform encoded by an isolated nucleic 
acrd molecule comprising a nucleotide sequence for a T2R allele, wherein the nucleotide sequence is 
selected from SEQ ID NO: 49, 55, 57, 59, 63, 65, 67, 69, 71, 75, 77, 79, 81, 83, 85, 89, 91 93 95 

151, 155, 157, 161, 163, 169, 173, 175, 177, 179, 181, 183, 187, 189, 191, 197, 199, 201, 20 205 

259, and 263; and detecting a change in the expression of at least one of the nucleotide sequences or a 
change in activity of at least one of the T2R taste receptors, or detecting binding of the compound to 
at least one of the T2R taste receptors or detecting a change in the electrical activity of at least one of 
the host cells or a change in intraceUular or extraceUular cAMP, cGMP, n>3, or Ca* of at least one of 
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haplotypes, demonstrating the direct influence of this gene on PTC taste sensitivity, and that variant 
sites interact with each other within the encoded gene product. 

Methods and Materials: 

PTC phenotype determinations. Subjects began tasting a solution of 1 micromolar PTC 
(solution #14) and proceeded in 2-fold increasing concentration increments (solutions 13, 12, 11...) 
until a bitter taste was perceived. Subjects then performed a blinded sorting test containing 3 cups of 
PTC solution and 3 cups of water. Raw taste threshold was the most dilute solution at which the 
subject could correcfly sort all 6 cups. We also included a quinine threshold measurement according 
to Blakeslee & Salmon (Proa. Natl. Acad. Sci. USA 21, 84,1935) to identify and exclude individuals 
with general deficits in bitter taste (aguesia). For dichotomous assignment of phenotype, we 
considered individuals unable to taste PTC before solution #6, i.e. at concentrations less than 267 
micromolar PTC, to be non-tasters. Although the classic method includes corrections for age and 
sex, analysis of our raw PTC taste threshold data indicated only a modest sex effect, with females 
more sensitive than males (p = 0.00324, proportion of variance explained - 5.1%). No effect of age 
on PTC scores was observed. As a result, raw PTC threshold scores were used for all analyses. 

Research subjects. The Utah C.E.P.H. families were enrolled in conjunction with the Utah 
Genetic Reference Project under University of Utah IRB approved protocol #6090-96, and consisted 
of individuals of Northern European ancestry. Subjects in the NIH replication sample were enrolled 
under NTH/NINDS IRB approved protocol # DC-01-230, and were of European, Asian, African 
American, and Native American ancestry. Human Diversity Panel DNAs (sub-Saharan African, 
Asian, and Southwest Native American) and primate DNAs were obtained from the Coriell Cell 
Repository, Camden, NJ. The Utah sample consists of 27 families comprising 269 individuals; bom 
haplotype and phenotype information was available for 180 of these individuals. The NIH replication 
sample of consisted of 85 unrelated individuals of known haplotype and phenotype; 51 were 
European, 5 Pakistani, 23 East Asian, and 6 African-American. One African-American is not 
considered in the analysis due to a rare AAV/AAI diplotype. His raw PTC score is 7. 

Bioinformatics analyses. Bioinformatics analysis was performed with the NCBI Human 
Genome databases (available on the Web at ncbijilm.nih.gov/genome/guide/human) and the Celera 
Discovery System (http://cds.celera.com/cds). Gene finding was performed with BLASTX (available 
on the Web at ncbi.nlm.nih.gov/BLAST) and GENESCAN and FGENES software (GeneMachine, 
DIR, NIH, ht^://genome.nhgri.nih.gov/genemachineO. SNPs were developed using the SNP 
database (available on the Web at ncbi.nhn.nih.gov/SNP/ ) 

PTC gene haplotyping. Haplotypes within the PTC gene were determined by performing 
genomic PCR to obtain a 1 195 bp product containing all 3 variant sites, using primers as follows: F = 
5' GCTTTGTGAGGAATCAGAGTTGT 3>, R = 5' GAACGTACATTTACCTTTCTGCACT 3'. 
The mass PCR product from each individual was cloned into TopoTA vector (Clonetech), and single 
colonies which contained a single amplified haplotype were picked and sequenced. 
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QTL linkage analysis. Quantitative trait linkage analysis was performed using SOLAR 
Almassy and Blangero, Am. J. Hum. Genet. 62, 1 198, 1998. The effect of PTC haplotypes on the ' 
linkage results was determined by performing two multipoint linkage analyses: one using the raw 
PTC scores and another using adjusted PTC scores, both with sex as a covariate. The first analysis 
excluded diplotypes as covanates, the second included them. For the latter, adjusted scores were 
obtained by subtracting off the mean of each diplotype group from the scores of individuals with that 
particular diplotype. 

Haplotype effect analysis. The effect of the PTC haplotypes, as well as the covanates sex 
and age, on raw PTC scores was estimated simultaneously in a multivariate analysis using the 
program SOLAR 26 . SOLAR estimates the proportion of variance explained by a covariate (eg., the 
PTC diplotype) in the presence of background polygenic variance, in this case estimated from ' 
residual familial correlation in the phenotype. The program also takes into account non- 
independence of sib genotypes. The confirmation sample of unrelated individuals was analyzed 
using multiple linear regression with sex and age as covariates as well as Analysis of Variance. 

GenBank. Human candidate taste receptor gene TAS2R38 (GenBank accession number 
AF494231) is identical to the sequence of the non-taster AVI form of the PTC gene, with the 
exception of nucleotide 557, which is an A (encoding Asn 186 ) in TAS2R38 and a T (encoding He 186 ) 
inPTC. 

Material in this example was published as Kim et al, Science 299:1221-1225, February 21 
2003, which publication is incorporated herein by reference in its entirety, including the supplemental 
matenal published on-line at scienemag.org/cgi/content/fiul/299/5610/121/DCl. 

Results and Discussion 

The inability to taste PTC {Science 73:4, 1931; Quo and Reed, Ann. Hum. Biol. 28:11 1, 
2001) was long believed to be a simple Mendelian recessive trait (Snyder, Science 74:151 193l' 
Levit and Soboleva, J. Genetics 30:389, 1935; Blakeslee, Proc. Acad. Natl. Acad. Sci. USA 18120 
1932; Lee, Ohio J. Science 34:337, 1934; Harris and Kalmus, Ann. Eugenics, London 15:24, 1949)' 
Over time however, many reports emerged which contradicted this model (Falconer, Ann. Eugenics 
13:211, 1946-47; ReddyandRao, Genet. Epidemiol. 6:413, 1989; Olsons al, Genet. Epidemiol 
6:423, 1989). Linkage studies have been equally conflicting. Initial studies provided very strong 
support for linkage to the KEL blood group antigen (later determined to reside on chromosome 7q3) 
(Chautard-Freire-Maia et al, Ann. Hum. Ge, ie t. 38:191, 1974; Conneally et al, Hum. Hered. 26:267 
1976), but other studies failed to provide significant support for this linkage (Spence et al, Hum. 
Genet 67:183, 1984). The only genome-wide linkage survey was performed with the related 
compound propyl-thiouracil. This study produced evidence for linkage to loci on chromosome 5p, 
and a suggestion of linkage to markers on chromosome 7q31, at a distance of -35 cM from KEL ' 
(Reed et al, Am. J. Hum. Genet. 64:1478, 1999). 
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Matenals and methods are available as supporting material on Science Or^^^^ 
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Table 3. Haplotype association with taste phenotypes 



Haplotypes Sample No. of subjects 



AVI /AVI 


Utah 


38 


x asters 
14 


NIH 


21 


0 


AVI /AAV 


Utah 


10 


7 




NIH 


1 


3 


*/PAV 


Utah 


3 


108 
58 


rtype found in the i 


NIH 

samnlp Mn 


1 
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subsets of the Utah sample. In the subgroups which were large enough to give accurate estimates 
heritability was 0.26 ±0.19 (83 subjects in 20 families) in the PAV/AVI subgroup, and 0 50 ± 0 33 in 
the AWAVI subgroup (46 subjects in 17 families). The increase in heritability in the loss of 
function diplotype group (AVI/AVI) indicates that there may be other genetic factors that interact 
w 1 mPTCandcanrestoresomemeasureoftastesensinvityinthisgroup. For Caucasians and East ' 
Asxans, our results are largely consistent with a model of a major recessive QTL modified either by a 
polygenic (Reddy and Rao, Genet. Epidemiol. 6:413, 1989) or single locus (Olson et al, Genet 
Epidemiol. 6:423, 1989) residual background effect. 

Due to the high frequency of the PAV and AVI haplotypes in the population, we sought to 
determine which haplotype represents the original form of the PTC gene. We sequenced this gene in 
6 pnmate species: humans and one individual each from chimpanzee, lowland gorilla, orangutan 
crab-eating macaque (an old world monkey), and black-handed spider monkey (a new world 
monkey), representing over 25 million years of evolutionary divergence. All of the non-human 
pnmates were homozygous for the PAV form, indicating that the AVI form arose in humans after the 
15 time they diverged from the nearest common primate ancestors. 

Five different haplotypes were observed worldwide (Table 4). In Europeans and Asians the 
taster haplotype PAV and the non-taster haplotype AVI make up the vast majority of haplotypes ' 
present. Two additional haplotypes, PVI and AAI, were observed only in individuals of sub-Saharan 
African ancestry, consistent with other reports of increased gene haplotype diversity in this 
population (Stephens et al, Science 293:489, 2001). The common non-taster AVI haplotype was 
observed in all populations except Southwest Native Americans, who were exclusively homozygous 
for the PAV haplotype, consistent with the reported low frequency of non-tasters in this population 
(Guo and Reed, Ann. Hum. Biol. 28:111, 2001). Thus overall, the worldwide distribution of these 
haplotypes is consistent with the large anthropologic literature on the distribution of this phenotype 
(Boyd, "Genetics & the Races of Man. An introduction to modem physical anthropology » Little 
Brown and Company, Boston, 1950; Tills et al, "The Distribution of Human Blood Groups and other 
Polymorphisms," Supplement, 1 st Edition. Oxford University Press, Oxford, 1983) 
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Table 4. Frequency of PTC Ge ne Haplotypes in Popu lations Worldwide 

Haplotype 
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Wee «, nonresrer-phenotypesw^. These tapiotypes hnd airnnar fre^L, 

20 ^;°°^" C ^ M ™ tt »^^-^8«»». In addition, Tajnna.sD^ 
20 Fn and L, . D and P atabatioa dereonstiuled a sigrerrean, deviation ftom ueutrebty bec^se of an 

< ,0 » The* results corebin. re suggest tha, baling natural selection has acred re retrain 
tastef and ^ontaster" alleles at die PTC locus in humans. 

25 Methods 

° f ^ deSOTW ' bOT ^ tate »°*»^-„f ( ^ce2 9 0i, 221 . I22 5 
2003, tred Drayna „ ul. (Hlm Oe»„ 1 ,2:567-572, 2003) (bore of wU cb are recurred ^ by 

30 ^T^^^'^^^^^wingdescenls: Afrioan (, sub^ahal 
Japanese, 12 Koreans, 7 Middle Easterns, .OP^ lOofcer Somber a^,, Europ _ (10 

( OSourewestNative Anrerioana). Porcoreparison, seances were a,so „b«Id tal 
chmpareree -Pan troglodyKs) and one gorilla (Gorilla gorilla) 

" two „ .^'^^'^^^^""^'eebnirpres. insucbtadividuals.fte 
two ahehc versrons of tire gene were olou.d as singre PCR produce srre ^dividual clones were 
sequenced » reveal bod. haplotypes. phyfc^ ^ 

^^^-io^^^PHVUPso^pao^tp^^^l 
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which was rooted using the gorilla sequence, was then used to determine the polarity of character 
states. Evolutionary relationships among haplotypes were visualized using a minimum spanning tree 
generated by the ARLEQU1N computer program (Schneider et al, "ARLEQUIN version 2.000: a 
software for population genetics data analysis. Genetics and Biometry Laboratory, Department of 
Anthropology, University of Geneva, Geneva, Switzerland 2000). 

Tajima's D (Tajima, Genetics 123:585-595, 1989) and Fu and Li's D and F statistics (Fu 
and Li, Genetics 133:693-709, 1993) were used to test the hypothesis that patterns of diversity in 
humans are consistent with the hypothesis of neutrality. To avoid confusion, we refer to Tajima's D 
as "DT» and Fu and Li's D as "DF." These tests were performed by simulating 10,000 gene 
genealogies and comparing statistics obtained from the simulations with the observed statistic, as 
described by Tajima {Genetics 123:585-595, 1989) and Fu and Li (Genetics 133:693-709, 1993). To 
incorporate varying assumptions about population size change in human populations, these 
simulations were performed using the algorithm of Rogers (Evolution 49:608-615, 1995). This 
algorithm assumes that human population sizes increased suddenly from an ancient population size 
(NO) to a larger population size (Nl), t generations ago, with infinite-sites mutation rate. Patterns of 
genetic diversity produced under these conditions approximate those produced under more 
complicated conditions, such as exponential and logistic growth (Wooding & Rogers, Genetics 
161:1641-1650, 2002). 

Tests for excesses of synonymous and nonsynonymous nucleotide substitutions were 
performed using the methods of McDonald and Kreitman (Nature 351:652-654, 1991) and Li etal 
(Mol Bio! Evol 2:150-174, 19SS). The McDonald-Kreitman test (McDonald & Kreitman, Nature 
351:652-654, 1991) uses a Fisher's exact test to determine whether the ratio of synonymous and 
nonsynonymous substitutions differs between two categories: polymorphisms that are variable within 
species and polymorphisms that distinguish species (i.e., fixed differences). We used the McDonald- 
Kreitman test to examine polymorphisms found in humans and chimpanzees. The KA/KS test 
determines whether there is an overall excess of synonymous or nonsynonymous nucleotide 
substitutions (Li et al., Mol Biol Evol 2:150-174, 1985). 

Tests for genetic differentiation between populations were performed using Slatkin's 
linearized FST statistic (Station, Genet Res 58:167-175, 1991). The statistical significance of these 
values was assessed using the bootstrap method of Excoffier et al. (Excoffier et al, Genetics 
131:479-491, 1992), in which observed values are compared with FST values simulated by randomly 
allocating chromosomes to different populations. These tests used 10,000 bootstrap replications. 

Results and Discussion 

DNA sequencing revealed five variable nucleotides in humans, as detailed in Example 1 . 
These variants were partitioned into seven haplotypes. The chimpanzee and gorilla were both 
homozygous at all nucleotide positions and thus carried one haplotype each. Human and chimpanzee 
sequences differed by an average of 8.3 nucleotides, as did human and gorilla sequences. The 
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chnnpanzee and gorilla sequences differed by six nucleotides. !n Table 5A, each haplotype is 
surnmarized in two rows. The top row sunnnarizes nucleotide variation in the haplot^e, and the 
bottom row summarizes amino acid variation in the haplotype. Each column represents a codon 
contanung a variable nucleotide position, which is indicated at the top of the column Shaded 
cohunns indicate the three variable ann,o acid positions used for haplotvpe designation m Example 1 
andKun**, (^299:1221-1225,2003). The number of occurrences of each haplotvpe is 
^catedin Table SB, for me AMcan (Af), Asian (As), European (Eu), and North AmericaT(NA) 
sample, Haplotvpe counts are not given for the chimpanzee and gorilla haplotypes (ptA and ggA 
respectively), which were each observed twice. ' 

Table 5A: Variable Nucleotide Positions in PTC Haplotypes. 



t 



hsA 

hsB 

hsC 

hsD 

hsE 
hsF 
hsG 

ptA 
ggA 



2 
3 

CGC 
-R- 

CGC 
-R- 

CGC 
-R- 

CGC 
-R- 



n 



2 
3 
9 



CCA 

GCA 
• -A- • 

vGCA 



-R- u^ : 

cgc ! JdCA 
-R- H'jyfc. 
CGC j <3CA 

-R- j -A- : 

I 

CAC ! CCA 
CGC CCA 



-H- 
CGC 
-T- 

CAC 
-H- 

CAC 
-H- 

CAC 
-H- 

CAC 
-H~ 

CAC 

-H- 
CAC 

-H- 



5 


6 


6 


7 


7 


9 


9 


3 


0 


2 


7 


2 


AAT 
-N- 


ACC 


AAC 


AAA 


-T- 


-N- 


-K- 


AAT 


ACC 


AAC 


AAA 


-N- 


-T- 


-N- 


-K- 


AAT 


ACC 


AAC 


AAA 


-N- 


-T- 


-N- 


-K- 


AAT 


ACC 


AAC 


AAA 


-N- 


-T- 


-N- 


-Kr 


AAT 


ACC 


AAC 


AAA 


-N- 


-T- 


-N- 


-K- 


AAT 


ACC 


AAC 


AAA 


-N- 


-T- 


-N- 


-K- 


AAT 


ACC 


AAC 


AAA 


-N- 


-T- 


-N- 


-K- 


AAC 


ACC 


GAC 


AAG 


-N- 


-T- 


-D- 


-K- 


AAC 


ATC 


GAC 


AAA 


-N- 


-I- 


-D- 


-K- 



3 ,vs 



8 8 
2 2 
- 0 2 

5 OCT 5 CGC 



-A- 
OCT 



-R- 

CGC 
-R- 

CGC 
-R- 

CGC 



-A- -R- 

OCT 'j CGC 



-A- 
OCT 
-A*" 
OTT 
-V- 



GCt 
; ~A- 



-R- 
TGC 

-C- 
CGC 

-R- 



CGT 
-R- 



OCT i CGC 

*- ; *A- '! -R- 



1: SEQ ID NO: (NA/AA) - Sequen^e'lD nu mber for Nucleoid ^0 ^5: 
Table SB: Occurrences of Variable Nucleotides in Human PTC Haplotypes. 



. 

i- '6': : 


] 99 
j 22 
^ 67 


9 
9 
5 


SEQ ID 
NO 
(NA/AAV: 


1 GTC 


: ATG 


CTG 






j -M- 


-Lr 






j 




153/154 




I ATG 


CTG 




-I- 


-M- 


-Lr 


155/156 


ATC 


ATG 


CTG 




-M- 












157/158 


ATC - 


ATG 


CTG 


"' -1- • 

•. •• \y 


-M- 


-L- 








159/160 


cine 

*-v-li- 


ATG 


CTG 




-M- 


-L- 


161/162 


Arc . 


ATG 


CTG 




+ i 


-M- 


-Lr 


163/164 


ATC ; 


ATG 


CTG 




* ! 


-M- 


-L- 


165/166 


qtc 


ACG 


CTG 


N/A 




-T- 


-L- 




GTC 


ACA 


CCG 


N/A 


-v- i 


-T- 


-P- 



15 



Af As Eu NA Total 

« hsA 38 76 51 19 184 



f hsB 

W hsE 
hsF 



1 
1 

9 
2 
1 



0 
0 
1 
0 
0 



hsG 10 61 



0 
0 
0 
5 
0 

54 



0 
0 
0 
0 
0 
1 



1 
1 

10 

7 

1 

126 



62 138 110 20 330 Total 



All five nucleotide substitutions observed in humans caused amino acid substitutions, as did 
three of the S1X subsututions distinguishing human and chimpanzee. TT>e observed rate of 
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nonsynonymous substitution was substantially higher than is usually observed in human genes 

Nat Rev Genet 4:99-1 1 1, 2003). To test the hypothesis that an excess 
substations was present, we first analyzed the human and chimpanzee sequences by use of a 
McDonald-Kreitman test (Nature 35 1:652-654, 1991), as described in the "Methods" section This 
test showed [mat the excess of nonsynonymous substitutions observed in humans was not statistically 

substi ta tions m dnotdrffer S ignMcandy frome xpectationunderneutraUt y (P>10) TheVe 
nonsignificantresults maybe attributable to the lownumber of polymorphisms observed, which 
weakens the tests. Thus, although notable for being higher than in most genes, the bias toward 
nonsynonymous variants in our sample was not sufficient to reject the hypothesis of neutrality. 

h . ™ C 7™ *~ sealed that the human sample was dominated by two major 

baplot^es,^ dhsG>differingby ^e^^^^ ^ 

(hsG) status, actively (Drayna et al, Hum Genet 1 12:567-572, 2003; Kim et al Science 

fieauenc.es: 0.55 and 0.38. A variety of factors, including population subdivision and balancing 
natiual selection can lead to me presence of two or more mtermediate-fiequency haplo^pes in gene 
genealog.es (Marjoram & Donnelly, Genetics 136:673-683, 1994; Bamshad and Woodml^L 
^4:9 -111,2003, The evolution of two or more mtermeaiate-frequency clusters is L 
su^nsmglycommonunder selectively ^c.^^*^,^^^ 

To ^whemerpa tt ernsofD^^^ 
hypothesxs of evolutionary neutrality, we analyzed the sequences by use of the DT, DF and F 
statistics. These statistics are functions of the number of variable nucleotide positions in a sample of 
sequences the -an pairwise difference between sequences, and m^ 

1^11 1989 ; fi FUMldLl ' GenettaS 133:693 - 709 ' example, positive natural selection 
leadmg to the rapxd fixation of a single, advantageous haplotype will result in a decrease in the 
expected number of polymorphic sites, a decrease in the mean pairwise difference between 
sequences and an increase in the number of variants observed only once in the sample (Fu and Li 
^133:693-709, 1993, In contrast, balancing natural selection can lead to an inclsTm a U 
three of these values (Fu and Li, Genetics 133:693-709, 1993). 

TeSte0 '* CSeSta ^^ 
faxledtore J ectmehypomesisofneutraHtyinPTC(DT==1.55,P>. 05 ;DF = -146 P> 9 0 - F 
0.50, P > .60). However, several fines of evidence based on archaeology, genetics,'and linguistics 
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suggest that human populations have grown dramatically (>100-fold) over the past 100,000 years 
(Ruhlen, «77«e origin of language." Jolm Wiley and Sons, New York 1994; Klein, "He human 
career: human biological and cultural origins." University of Chicago Press, Chicago 1999; Stiner * 
al., Science 283:190-194, 1999; Excoffie, Curr Opto Genet Dev 12:675-682, 2002). Suchgrowth is 
known to have strong effects on genetic diversity (Rogers & Harpending, MolBiolEvol 9-552-569 
1 992). For example, diversity patterns in populations that have grown are often characterized by an 
excess of low-frequency genetic variants and a low mean pairwise difference between sequences 
both of which lead to reductions in the expected values of all three of the statistic we tested 
(Wooding & Rogers, Genetics 161:1641-1650,2002). Given evidence for population increase in the 
Upper Pleistocene and the possible effects of this growth on patterns of genetic diversity, the 
assumption of constant population size is likely inappropriate. 

To investigate the possibility that incorrect assumptions about population history were 
causing a type H statistical error (i.e., a failure to reject the null hypothesis of neutrality when it is 
false) m our initial tests, we devised new tests of the DT, DF, and F statistics that take population 
growth into account These tests were performed as described in the "Methods" section, under the 
assumption that human populations increased suddenly from an ancient effective population size of 
10,000 to a larger effective population size, Nl, t years before present, with a nucleotide-substitution 
rate of 10-9/site/year. These values are representative of those inferred for nuclear genes in humans 
(Tishkoff & Verrelli, Annu Rev Genomics Hum Genet 4:293-340, 2003). Because there is some 
disagreement about the timing and magnitude of this expansion (Hey, MolBiolEvol 14166-172 
1997; Fay and Wu, MolBiolEvol 16:1003-1005, 1999; Harris and Hey, Evol Anthropol 8:81-86' 
1999;HeyandHarris,M,/5 I o/^v 0 /16:1423-1426, 1999; Harpending and Rogers, ^ ' 
Genomics Hum Ge, l 1:361-385, 2000; Wall & Przeworski, Genetics 155:1865-1874, 2000; Excoffier 
^0^^2^12:67^ " 
iteratively tested the DT, DF, and F statistics for population histories with magnitudes of population 
growth from 1-fold to 1,000-fold and dates of population expansion from 0 to 200,000 years ago 

This procedure revealed that tests of all three statistics are highly sensitive to assumptions 
about population growth. For example, the assumption of 100-fold growth 100,000 years ago resulted 
m a change of DT's P value from .07 to .01. CIs generated for the DT statistic showed that the 
hypothesis of neutrality was rejected (at a two-tailed P value cutoff of .025) under all population 
histories in which the human population expanded between 15-fold and 1,000-fold between 10 000 
and 200 000 years ago. In addition, under the population history parameters for which observed DT 
values differed significantly from expectation, the values were greater than expected Thus, our data 
departed from expectation in a direction consistent with the hypothesis of balancing natural selection 
Tajnna, Genetics 123:585-595, 1989; Fu and Li, Genetics 133:693-709, 1993). Results were similar 
for CIs generated using the DF and F statistics, which rejected the hypothesis of neutrality for all 
population histories in which the human population expanded between 15-fold and 1,000-fold 
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between 30,000 and 200,000 years ago. Thus, the hypothesis of neutrality in PTC was rejected by 
these tests under all but the most conservative assumptions about population growth in humans. 

The sensitivity of the DT, DF, and F statistics to population growth has implications beyond 
the detection of natural selection in PTC. All three of these statistics are widely used in tests for 
natural selection in humans, usually under the assumption that human population sizes have remained 
constant (Tishkoff & Verrelh, Annu Rev Genomics Hum Genet 4:293-340, 2003). As we have 
shown, this assumption is highly conservative in the detection of balancing natural selection. 
However, the assumption of constant population size is anticonservative in the detection of positive 
natural selection, which leads to reductions in diversity nearly identical to those caused by population 
growth (Tajima, Genetics 123:585-595, 1989; Fu and Li, Genetics 133:693-709,). For this reason, 
tests for positive natural selection that use the DT, DF, and F statistics are vulnerable to type I 
statistical errors (i.e., the rejection of the null hypothesis when it is true) if human population 
increases are not taken into account With this in mind, the significance of many earlier tests of the 
DT, DF, and F statistics in humans, including our own (e.g., Wooding & Rogers, Hum Biol 72:693- 
695, 2000; Wooding et al, Am J Hum Genet 71 :528-542, 2002, may need to be reconsidered. 

Balancing selection is not the only force that can lead to significantly high DT, DF, and F 
values. Such patterns can also be caused by population subdivision, which allows the persistence of 
divergent haplotypes in different geographical regions (Kaplan et al, Genet Res 57:83-91, 1991; 
Hufconer^Mo/Sto/i^^^ ' 

Laporte & Charlesworth,. Genetics 162: 501-591, 2002). In our analyses, two sources of population 
subdivision were potentially important: subdivision between continents and subdivision within 
Africa. 

Population subdivision between continents is not large (Tishkoff & Verrelli, Annu Rev 
Genomics Hum Genet 4:293-340, 2003), but it could be sufficient to confound statistics like DT, DF 
andF. Totestthehypothesisthatmepre^ ' 
data is the result of subdivision between continents, we analyzed patterns of genetic differentiation 
among continental populations by use of the FST statistic, which compares the level of genetic 
diversity within subpopulations to levels of diversity in the population as a whole (Hartl and Clark, 
Sinauer Associates, Sunderland, MA, 1997). In our data, diversity patterns were driven largely by 
the frequencies of the hsA and hsG haplotypes, which were present at similar frequencies in most 
populations. The FST value observed among all four continental samples was 0.056. This value is 
significantly different from zero (P < .025) but is lower than is typically observed in nuclear genes 
which generally have values of 0.15 (Przeworski et al, Trends Genet 16:296-302, 2001; Schneider et 
al, Mech Ageing Dev 124:17-25, 2003; Tishkoff & Verrelli, Annu Rev Genomics Hum Genet 4:293- 
340, 2003; Watkins et al, Genome Res 13:1607-1618, 2003). This FST value is lower than 80% of 
those reported for 25,549 SNPs by Akey et al (Genome Res 12:1805-1814, 2002), for instance and 
is also lower than 45% of those reported for 1,627 genes by Schneider et al (Mech Ageing Dev 
124:17-25, 2003). The latter sample would be expected to have exceptionally low FST values 
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because * toctoded a large nun**, rfaMWl ftom ^ ^ ^ m 
A>n*,caus and ^pauic-Lahnos. Tb. FST obs«ved to „„ SMnpk suggests ^ 

^^-^' i »»«FSTva lm! wa resta ,ngl y a& ctt dby t hetocta irai „ f d leNonh 
^n. san*Ie owtog to ft. v«y Ugh fteqllHlcy olae hsA 

whofc. 0.025. Tb, value . stgnfflconny greater to „ (p < . 025) „„, „ 
^^<by Setose,.,. ( ^^^ IM:I7 . ^ w 

values reported by Akey er a/. {Genome Res 12:1805-1814, 2002). TTie strong effect of the North 
^ans^leeon J dbod„etoava ri «^„f^ re . p ^ < „ NorthAllCTtein 

The btsnny and geog^phy of hnntan genes," ft*^, ^ m 

Seoond, arohaeotogtoa. and hnguisuc eviderate suggest ft., North America « no. tohabhed by 
Inntnms^Urecenfl^jS.OOO^^^^ / 

-----332,.^,,^^^^^^^^- 

populations descended from a relatively small m. m i»,~r« j " * 

n • ^ araa aveiysmaU number of founders that entered the Americas via the 

E ^ fOT ^ iv « s ^^»°ta»laol^f.«nd^uto,Afiica( S cb>»id„ & 

!tl ? • te 16I:M " 74 ' 20o2) - ^ «■*»»"» 

oftthvunon wtthn. Africa codd inflate DT, DP, and F atodsues, yielding a u. stgnatore of 
batoncang nanna. se.cchcaa To tea, ft. hypofteais fta, aubdiviato. wiftto fte AMcan ft «. 

stodywa s „tofcaftehighobse^v«„.soffte.c stt « i s a cstoour ! ^ I e„, w ^J 

than for the sanrpto ., a whole, no, lower as wotod be expected if subdivision wiftta Afitoa wl 
solely explain the high DT values observed to PTC. ^ Mt 
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Table6: Results of Statistical Tests 



Sample 



Africa 

Asia 

Europe 

North America 
All 



Observed Value for 

Statist ic 

-A. 



P Value for 



.46 
2.94 
2.91 
-2.66 
1.55 



F 



-.89 -.56 

-.67 .57 

-62 .59 

-2.58 -3.05 

-1-46 -.50 



No Growth 



D T 



F 



.18 
.01 
.01 
.99 
.08 



.80 
.76 
.81 
.99 
.90 



.56 
.28 
.28 
.68 
.64 



Growth 



.01 
.01 
.01 
.99 
.01 



.03 
.01 
.01 
.91 
.01 



.01 
.01 
.01 
.87 
.01 



expanded 100-fold. assumption that, 100,000 years ago, the human population 

Taken together, three lines of evidence suggest that balancing natural selection has acted to 
evels of diversity inhumanpopulations. First, two haplotypes s.on g l y associated 
wr^onaUydtvergentphenotypcs d-^*.^ a^^.,^.,^^ 

sxgmficantly more intermediate-frequency variants than expected under neutrality (P < .01) Third 
' dishihutionof me taster and nontaster aUeles is not consistent wim the hypothesis' 
*at they have ansen through population subdivision within or between continents. Thus, K A 
Ftsher s hypothesis that balancing natural selection has maintained taster and nontaster alleles ' 
appears to hold true in humans (Fisher et al, Nature 144:7-50, 1939). 

Evidence for balancing selection at the PTC locus does not imply that other selective 

evoluhonofthenontaster allele, which was men manned by balancmg selection. This possibihty 
^ghtexplamme unusual 

gene, t ts also possible that specific PTC alleles have been favored by positive natural selection in 
particular envnonments, resulting in local adaptation. Such effects might account for the high 

frequency ofPTC taster a^les in New World populations and the significant low DT, DF, ^d F in 
our North American sample. ' 

™ em ~^th,o U ghwh^ 

alleles m human populations remains unclear. No stop codons or deletions, which might yield 

arepresen moursample, two account for >9 0 o/ 0 of observations: hsAandhsG. If nontaster alleges 

be expected (Hardmg et al., Am J Hum Genet 66:1351-1361, 2000). One possibility is that PTC 

bxtter toxms than homozygotes. We currently believe that PTC nontaster alleles may encode 
functional receptor molecules that bind to toxic bitter substances other than PTC. 
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Example 3: Identification ofSNPs in Other T2R Bitter Taste Receptor, 
Common allelic variants of a member of the TAS2R hitw ♦ ♦ 

bitt* recepto, we have sequenced 22 of fce 24 bo™ TAS^o. " "*" 

Umgconve D tiomlm e iood»,n^e r s„f (n eh mm „ T ,o, •■ . ecepBIS - 

See „so Table, 7A 7B J^T, *" ^ ^ " ^ f » *~ «"~ 

au ^ 1 2R genes contain common SNP's within tw • 

20 an average of 4.4 SNP's per T2R ««. ™ 8 S6qUenCe ' "* WC identified 

8 8 per T2R gene. Fifteen variants listed in dbSNP were not nflW n ♦ u 

polymorphic in our sample. However, many novel SNPs were identify 

WdesignauonmFigu,, Of^L^^ ^^^^ 
substitution in the encoded recent™ ■ ■ ■ *' 6 a ° ammo a< ? id 

25 U« geoe. Some of*. SNP's «,,. oK 1 . "* " 1 fa - fe ™ e S *"> "*» ta 

30 Example* Worldwide Coding Sequence Variation *. w 

tm , J eaue "<* Variation m Human Bitter Taste Receptors 

.gg™g.., aK228 .o«ijr^:^7 K " 4 - 2 ^- ta °»='<'~-. i o 
» . ff « B .f^ S e^ooor te z jrrr codto8iBpio ' wes - 
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genes. In addition, the mean FST value was significantly greater than expected, indicating that me 
high D values are attributable to differences between, not within, populations. Unlike the 
phenylthiocarbamide receptor, which shows evidence of strong balancing selection, other human 
bitter receptor genes appear to be influenced by local adaptation. It is proposed that these genes have 
adapted under natural selection imposed by toxic bitter substances produced by plants 

The human T2R38 (PTC) gene has been shown to exist in seven different allelic forms 
although only two of these, designated the major taster form and the major non-taster form, exist at 
high frequency outside of sub-Saharan Africa (Example 1 and 2; Wooding, et al, Am. J. Hum Genet. 
74:637-646, 2004). These two forms have been shown to be maintained by balancing natural 
selection, and it has been suggested that the non-taster form serves as a functional receptor for some 
other bitter substance not yet identified. T2R3 8 (PTC) studies suggest that there may be substantial 
additional complexity in the task of identifying specific ligands for each bitter taste receptor as 
different alleles of each gene may encode receptors that recognize different ligands. To facilitate the 
resolution of this problem, we here identify all of the common variants and haplotypes of the nearly 
complete bitter receptor gene repertoire in humans, and examine population genetic aspects of this 
variation worldwide. All of the SNPs in T2R43 (see Example 3 and Figure 1) were found in 
Cameroonian random individuals. Because parental genotypes were not available for these samples, 
the haplotypes for T2R43 were not determined in this study. 

20 Materials And Methods 

Population samples: Human genomic DNA was obtained from 55 unrelated individuals in 
5 different geographic populations including 21 Cameroonians, 10 Amerindians, 10 Japanese 9 
Hungarians, and 5 Pygmies. All DNA samples except Cameroonian were provided by Corieli Cell 
Repositories. 

PCR and DNA sequencing: We sequenced the open reading frame (ORF) of 21 out of 23 
human T2R genes and combined this information with data from the T2R38 (PTC) gene published 
previously (Wooding et al, Am. J. Hum. G^et. 74:637-646, 2004). Primers for PCR amplification 
and for sequencing were designed (using software at the Primer3 Web site) to amplify the entire ORF 
of each T2R gene in humans. 

PCR was performed in a total volume of 25 ul, containing 0.2 uM of each deoxynucleotide 
(Invitrogen), 15 pmol of each forward and reverse primers, 1.0 - 1.5 mM of MgC12, 10 mM Tris-HCl 
(PH8.3), 50 mM KC1, 0.75 U of Taa DNA polymerase (PE Biosystems), and 100 ng of genomic 
DNA. PCR conditions (PE9700, PE Biosystems) were as follows: 35 cycles of denaturation at 94 °C 
for 30 sec; annealing at 55 °C or 57'C, depending on the primers for 30 sec; and extension at 72 °C 
for 1 mm. The first step of denaturation and the last step of extension were 95°C for 2 min and 72 °C 
for 10 min, respectively. Five microliters of the PCR products were separated and visualized in a 2% 
agarose gel. Fifteen microliters of this PCRproduct were then treated with 0.3 U of shrimp alkaline 
phosphatase (USB) and 3 U of exonuclease I (USB) at 37»C for 1 hr, followed by incubation at 80»C 



25 



30 



35 
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<br 15 mm. This w diluted wim „ ^ vohmc of dHrO, and 6 jtl was used fo, ft. tod 
sequencmg reaction. Sequencing reacdons were performed „ bod, directions on ft. PCR producte in 
mactioo,cm,te^ 5pmo , ofprimer> , ,a ofBigDye TennimtorRe , dy dUC,SU ■ 

DmaAnmyslsr I^godisoq^osnbolweoopsfeofSr^asesdmssodusm. 
Lewo.^ disequmbnum staU!Sc , D . (Lewontfa) ^ ^ D* values were 

calculated usmg dm qou, Mflwra ^ (Abemis _ „ „, ^ 
Haplotype aud recombination • between SMP palm worn ralfalated ^ ^ * 

.65^13-2233, 2003). Tlus method uses 1 i te liuood-ba s ed algorithms to astimate a baseliue . of 
mcombtnation across a mgion, as we,, M reMve rates of recombhumon among SNP pah* 



"^^w OT c„ud„eW„^Ta jtaa -,Dsu,Usdo,„uic h cou lr ^sd« m ^ 
p™. dufeseuce betiveen rundmnly oboseu sequem*. „ , ^ ^ fc ^ „ 

bodi ^ aJlma * 123:585-595, 1989). We tested the hypothesis of selective neutrality^ under 

both the aasutnpuouofoonmmpoprna^^ 

yean i ,go,a S de S oribedbyWooding«u/.(^.y.a,, 1 . Ge ^. 74:6 3 7 . 646 2004) ' 

The diaMbution of T.jirua's D values in our sample was tested to determine whettter the 
^^-"^mcep^g^^^o^^^^ 
F^fornred by sunu^mg ,0,000 seat of 2. genes and comparing «. mean D value of eaob simulated 

lO^TTTT - 10 — **««">• ^^wasperformer, by random! y siting 
b n "** *-™ i '*-<-«*-f-» Thefcmtionofcompariuonsin 



tto,H82Corr tofem ^^^^ 

2>ev. 12.675.o82, 2002). Tms method compares m. obse^d value of FST with value, simulated by 
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whata : (ta mem FST m our da* was sig»ffioa„„ y grctor ^ ^ ^ FSJ ^ 

procedure was repeated 10 000 timpc u a «. - ^ p a * Lms 

y aicu iu,uuu tunes. The traction of comparisons in which th* pqt i * ^ 

Result and Discussion 

J" rT * 1 — «*• TAS2R) 8Mes to find ^ ^ ^ 

greatest number of these rare alleles, consistent with the v,v™tw a 

^d 25 cSNPs altos. «ly *c cSNP. to „, ^ » 
Table 7A: Bitter Taste Receptor Variants. 



GENE 


Total SNPs/ 
No. NONSY/ 
No. Haolotypes 1 


LIST OF 
HAPLOTYPE 
SEQUENCES 


NUMBER 
(N=ll(» 2 


TOTAL 
FREOUENCY 3 


SEQBD 
NO: 


T2R01 
T2R03 


3/2/3 


GC 
GT 
AC 


79 
5 

26 


0.718 
0.045 
0.236 


(NA/AAV 

47/48 
49/50 


T2R04 


i/1/2 


C 

T 


109 
1 


0.990 
0.010 


51/52 
53/54 




8/7/8 " 








55/56 
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Total SNPs/ 
No. NONSY/ 
GENE No. Haplotvpes 1 



LIST OF 
HAPLOTYPE 
SEQUENCES 



NUMBER 
(N=11Q) 2 



T2R05 



7/6/7 



T2R07 6/6/5 

(INCLUDING 1 
STOP) 



GATTCCG 


8 


GATTCCA 


12 


GATTCGG 


50 


GATACCG 


2 


GACTCCA 


34 


GACTTCA 


1 


GCCTCCA 


1 


AACTCCA 


2_ 


GCCAGG 


54 


GCCAAG 


4 


GCCGGG 


5 


GCTAGG 


1 


TCCAGG 


44 


TCCAGT 


1 


TTTAGG 


1 



T2R08 



6/5/6 



TGCACG 
TGCACA 
TGCATG 
TGTACA 
CTCTCG 



T2R09 



CTATA 
CTATG 
CTACA 
CTGTG 
CGATG 
TTATG 



7/7/8 



6/3/4 



T2R13 



T2R14 



1/1/2 



4/2/3 



CCTTGGC 
CCTTGGA 
CCTTGTC 
CCTCGGC 
CCTCAGC 
CCATGGC 
CATTGGC 
ACTTGG C 

TAT 
TAC 
TCT 
CAT 

A 
G 



98 

2 

3 

6 

1 



25 
80 
1 
2 
1 
1 



63 
2 
1 

35 
1 
3 
2 

_3_ 

98 
1 
1 

10 

75 
35 



TOTAL 
FREQUENCY 3 

0.073 
0.109 
0.455 
0.018 
0.309 
0.009 
0.009 
0.018 

0.491 
0.036 
0.045 
0.009 
0.400 
0.009 
0.009 



0.891 
0.018 
0.027 
0.055 
0.009 



0.227 
0.727 
0.009 
0.018 
0.009 
0.009 



0.573 
0.018 
0.009 
0.318 
0.009 
0.027 
0.018 
0.027 

0.891 
0.009 
0.009 
0.091 

0.682 
0.318 



SEQ ID 
NO: 
(NA/AAV 




73/74 
75/76 
77/78 
79/80 
81/82 
83/84 
85/86 



87/88 
89/90 
91/92 
93/94 
95/96 



97/98 
99/100 
101/102 
103/104 
105/106 
107/108 



109/110 
111/112 
113/114 
115/116 
117/118 
119/120 
121/122 
123/124 




133/134 

135/136 
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Total SNPs/ LIST OF " " 

^™ No.NONSY/ HAPLOTYPE NUMBER TOTAL, 
GENE No. HanlotvnPQ 1 QimTTi?7vr™?o ^ T TTTTj AU1AL 
gggg nto. napiorypes SEQUENCES fN=110) 2 FREOUENCY 3 


SEQID 

NO: 
(NA/AAV 


AA 107 0.973 
AG 1 0.009 

"raw 87471 — 2 2518 


137/138 
139/140 
141/142 


GCTA 67 0.609 
GCTG 28 0.255 
GCGA B 0.118 
GTTA l 0009 
ACTA 1 0 009 


143/144 
145/146 
147/148 
149/150 
151/152 


T2R38 5/5/7 " 

(PTC) CACCG 184 0.558 

CATCA 1 o.003 
GGCCA 1 o.003 
GACCA 10 0.030 
GACCG 7 0.021 
GACTA l o.003 
GATCA 126 0 382 


153/154 

155/156 

157/158 

159/160 

161/162 

163/164 | 

165/166 


T2R39 2/2/2 

CA 107 0.973 
■ TG 3 o 027 


167/168 
169/170 


T2R40 3/3/5 " 

CAG 90 0.818 
CAA 8 0.073 
CQ G l o.009 
AAG 9 o.082 
AGO 2 Q.Q18 


171/172 

173/174 
175/176 
177/178 
179/180 


T2R41 4/2/3 

CT 84 0.764 
CA 1 0.009 
TT 25 0 227 


181/182 
183/184 
185/186 


T2R44 11/9/7 ■ 

TAGCCTACG 27 0 245 
TAGGCTAGG 6 0055 
CAGCCTGGT 1 0 009 
CAGCCCGCG 28 0255 
CTGCCCGCG 45 0409 
CTGCTCGCG 1 0 .009 

CTACCCGCG 1 nm., 


187/188 
189/190 
191/192 
193/194 

IOC/1 Q/r 

197/198 
199/200 


T2R46 6/5/6 ~ — 

(INCLUDING 2 

STOPS) TAACC 22 0 .204 
TAAGC l o.009 
TAGCC 6 0.056 
"GCC 75 0.694 
TTGCT l o.009 
GAACC 3 o 02R 


201/202 
203/204 
205/206 
207/208 
209/210 
211/212 


T2R47 5/3/4 ~ 
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GENE 


Total SNPs/ 
No. NONSY/ 
No. Haplotypes 1 


LIST OF 
HAPLOTYPE 
SEQUENCES 


NUMBER 
(N=110) 2 


TOTAL 
FREQUENCY 3 


SEQID 

NO: 
(NA/AA) 4 






AAG 


32 


0.291 


213/214 






AAT 


76 


0.691 


215/216 






AGG 


1 


0.009 


217/218 






GAG 


1 


0.009 


219/220 


T2R48 


12/9/9 














GCAAATGCT 


33 


0.300 


221/222 






GCAAATGCC 


53 


0.482 


223/224 






GCAAATGTC 


3 


0.027 


225/226 






GCAAATCCT 


1 


0.009 


227/228 






GCCAATGCC 


5 


0.045 


229/230 






GAAAATGTC 


1 


0.009 


231/232 






ACAAATGCC 


12 


0.109 


233/234 






ACAAACGCC 


1 


0.009 


235/236 






ACATGTGCC 


1 


0.009 


237/238 


T2R49 


11/9/7 














AGCCGATGA 


37 


0.378 


239/240 






AGCCGATGG 


2 


0.020 


241/242 






AGCCAATGA 


1 


0.010 


243/244 






AGCAGATGA 


3 


0.031 


245/246 






AGAAGGCTA 


19 


0.194 


247/248 






AAAAGGCTA 


13 


0.133 


249/250 






GGCCGATGA 


23 


0.235 


251/252 


T2R50 


7/3/4 














CGG 


77 


0.700 


253/254 






CGA 


30 


0.273 


255/256 






CTG 


1 


0.009 


257/258 






TGA 


2 


0.018 


259/260 


T2R60 


2/1/2 














A 


92 


0.958 


261/262 






T 


4 


0.042 


263/264 




127/95/109 












(INCLUDING 3 
STOP CODON) 











1: NONSY -Nonsynonymous Substitution 



2: N— Number examined. N does not apply to T2R38, T2R46, T2R49, T2R60. N of T2R38 = 330; T2R46 « 

108; N of T2R49 = 98; N of T2R60 = 96. 
3: Total Frequency =» Total FRE of 5 Populations 
5 4: SEQ LD NO: (NA/AA) - Sequence ID number for Nucleic Acid / Amino Acid. The reference, previously 

known sequence is indicated in bold; corresponding GenBank Accesion numbers are listed in Table 7C. 



Table 7B: Haplotype Distribution in Various Populations 



GENE 


LIST OF 
HAPLOTYPES 


CAM 1 AME 1 JAP 1 HUN 1 


PYG 1 


Number 2 / 

Total 
Frequency 


SEQID 

NO: 
(NA/AA) 3 


T2R01 














GC 


39 6 12 14 


8 


79/ 0.718 


47/48 




GT 


0 0 12 


2 


5 / 0.045 


49/50 




AC 


3 14 7 2 


0 


26/ 0.236 


51/52 


T2R03 
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LIST OF 
GEJNE HAPLOTYPES 

C 
T 



T2R04 



T2R05 



T2R07 



GATTCCG 
GATTCCA 
GATTCGG 
GATACCG 
GACTCCA 
GACTTCA 
GCCTCCA 
AACTCCA 



GCCAGG 
GCCAAG 
GCCGGG 
GCTAGG 
TCCAGG 
TCCAGT 
TTTAGG 



TGCACG 
TGCACA 
TGCATG 
TGTACA 
CTCTCG 



T2R08 



T2R09 



CTATA 
CTATG 
CTACA 
CTGTG 
CGATG 
TTATG 



CCTTGGC 
CCTTGGA 
CCTTGTC 
CCTCGGC 
CCTCAGC 
CCATGGC 
CATTGGC 
ACTTGGC 




19 
20 
1 


0 

20 


0 
19 


0 
17 


6 
4 


25/ 
80/ 


0.227 
0.727 


97/98 
99/100 


2 


0 


0 


0 


0 


1/ 


0.009 


101/102 


0 
0 


0 
0 
0 


0 
0 

1 


0 
1 
0 


0 
0 
0 


2/ 
1/ 
1/ 


0.018 
0.009 
0.009 I 


103/104 
105/106 
107/108 



T2R10 



109/110 
111/112 
113/114 
115/116 
117/118 
119/120 
121/122 
123/124 
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LIST OF 
GENE HAPLOTYPES 


Number 2 / 

CAM 1 AME 1 JAP 1 HUN 1 PYG 1 Fr^SL 


1 SEQID 

NO: 
I (NA/AA) 3 




TAT 
TAC 
TCT 
CAT 


31 20 20 18 9 98/ 0 891 

0 0 0 0 1 1/ o.009 

1 0 0 0 0 1/ 0.009 
— — 0 0 0 0 10/ Q.Q91 


1 125/126 
127/128 
129/130 
131/132 




T2R13 

A 
G 


40 10 5 10 10 75/ 0.682 
? *0 l^ 8 0 35/ 0.318 


133/134 
135/136 




T2R14 

AA 
AG 
GA 


40 20 19 18 10 107/ 0.973 
J 0 1 0 0 1/ 0.009 
^ 0 0 0 0 21 OPTS 


137/138 
139/140 
141/142 




T2R16 

GCTA 
GCTG 
GCGA 
GTTA 
ACTA 


23 16 11 9 8 67/ 0.609 
£ 4 9 9 0 28/ 0.255 
12 0 0 0 1 13/ 0.118 
0 0 0 0 1 1/ 0.009 
1 0 0 0 0 1/ 0.0Q9 


143/144 
145/146 
147/148 
149/150 
151/152 




T2R38 

(PTC) CACCG 
CATCA 
GGCCA 
GACCA 
GACCG 
GACTA 
GATCA 


N= 330 
184/ 0.558 
1 / 0.003 
1 / 0.003 
10 / 0.030 
7 / 0.021 
1 / 0.003 

. 126/ 0.382 


153/154 
155/156 
157/158 
159/160 
161/162 
163/164 
165/166 




T2R39 

CA 

TG 


39 20 20 18 10 107/ 0.973 
2 2 0 0 0 3/ 0.09.7 


167/168 
169/170 


T2R40 

CAA 
CGG 
AAG 
AGG 


33 12 20 16 9 90/ 0.818 
0 8 0 0 0 8/ 0.073 
\ 0 0 0 0 1/ 0.009 
6 0 0 2 1 9/ 0082 
— ± °- 0 0 0 21 0.018 


171/172 
173/174 
175/176 

1 '7*7/1 no 
l / 111 /o 

179/180 




T2R41 

CT 
CA 
TT 


41 13 11 9 10 84 / 0.764 
\ 0 0 0 0 1/ 0.009 
— 7 9 9 0 25/ 0.227 


181/182 
183/184 
185/186 


T2R44 

TAGCCTACG 


6 10 4 7 0 27/ 0.245 


187/188 
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LIST OF 
GENE HAPLOTYPES 


Number 1 / 

i * , Total 
CAM 1 AME JAP HUN 1 PYG 1 Frequency 


SEQED 
NO: 
(NA/AA) 3 


TAGGCTAGG 
CAGCCTGCT 
CAGCCCGCG 
CTGCCCGCG 
CTGCTCGCG 
CTACCCGCG 


1 0 0 5 0 6/ 0.055 

1 0 0 0 0 1/ 0.009 
0 10 14 4 0 28/ 0.255 
32 0 2 2 9 45/ 0.409 
0 0 0 0 1 1/ 0.009 

2 0 0 0 0 2/ 0.018 


189/190 
191/192 
193/194 
195/196 
197/198 
199/200 


T2R46 

TAACC 
TAAGC 
TAGCC 
TTGCC 
TTGCT 
GAACC 


N= 108 

3 10 3 6 0 22/ 0.204 

0 0 1 0 0 1/ 0.009 

1 0 0 5 0 6/ 0.056 
34 10 16 5 10 75/ 0.694 

1 0 0 0 0 1/ 0.009 
3 0 0 0 0 3/ 0.028 


201/202 
203/204 
205/206 
207/208 
209/210 
211/212 


T2R47 

AAG 
AAT 
AGG 
GAG 


7 1° 4 11 0 32/ 0.291 
34 10 16 6 10 76 / 0.691 
1 0 0 0 0 1/ 0.009 
0 0 0 1 0 1/ 0.009 


213/214 
215/216 
217/218 
219/220 


T2R48 

GCAAATGCT 
GCAAATGCC 
GCAAATGTC 
GCAAATCCT 
GCCAATGCC 
GAAAATGTC 
ACAAATGCC 
ACAAACGCC 
ACATGTGCC 


7 1° 4 12 0 33 / 0.300 
21 6 15 6 5 53 / 0.482 

0 0 0 0 3 3/ 0.027 

1 0 0 0 0 1/ 0.009 
0 4 1 0 0 5/ 0.045 

0 0 0 0 1 1/ 0.009 
11 0 0 0 1 12/ 0.109 

1 0 0 0 0 1/ 0.009 
1 0 0 0 0 1/ 0.009 


221/222 
223/224 
225/226 
227/228 
229/230 
231/232 
233/234 
235/236 
237/238 


T2R49 

AGCCGATGA 
AGCCGATGG 
AGCCAATGA 
AGCAGATGA 
AGAAGGCTA 
AAAAGGCTA 
GGCCGATGA 


N= 98 

23 0 0 6 8 37/ 0.378 

2 0 0 0 0 2/ 0.020 
1 0 o 0 0 1 / 0.010 

3 0 0 0 0 3 / 0.031 
1 6 9 3 0 19/ 0.194 

1 4 7 1 A 14. / n loo 
^ ' * U 13 / 0.133 

3 1° 4 6 0 23 / 0.235 


239/240 
241/242 
243/244 
245/246 
247/248 
249/250 
251/252 


T2R50 

CGG 
CGA 
CTG 
TGA 


40 10 4 13 10 77/ 0.700 
1 10 14 5 0 30/ 0.273 
1 0 0 0 0 1/ 0.009 
0 0 2 0 0 2/ 0.018 


253/254 
255/256 
257/258 
259/260 


T2R60 


N= 96 
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LIST OF 
GENE HAPLOTYPES 


Number 2 / 

, , Total 
CAM 1 AME JAP 1 HUN 1 PYG 1 Freauenrv 


SEQID 
NO: 
(NA/AA) 3 


A 
T 

NOTES: 


27 20 20 18 7 92 / 0.958 
3 0 0 0 1 4/ 0.042 


261/262 
263/264 



1 : CAM = Cameroonian; AME -Amerindian; JAP = Japanese; HUN - Hungarian; PYG = Pygmy 

4- OTOTO»S^NAMA\ 2 ^ °Sr^t iDdiCated) - T ° tal Frequen °y - Toti of simulations 
4. SBQ ID NO: (NA/AA) - Sequence ID number for Nucleic Acid / Amino Acid. 



Gene 


GenBank Accession 


GenBank Allele 


■vcAerence sequence 

GenBank Sequence 
SEQ ID NOs.(NA/AAV 


NA 
position 


NA Variants 


AA position 


AA Variants 


T2R01 

! 332 
616 


AF227129 

G/A 
C/T 


GC 
111 

206 


47/48 

Arg/His 
Arg/Trp 


1ZKU3 

349 


AF227130 

C/T 


C 

117 


53/54 
Pro/Ser 


T2R04 
8 
17 
20 
186 
221 
268 
512 


AF227131 

G/A 
A/C 
T/C 
T/A 
C/T 
C/G 
G/A 


GATTCGG 

3 
6 
7 

62 
74 
96 
171 


61/62 

Arg/GIn 
Tyr/Ser 
Phe/Ser 
Phe/Leu 
Thr/Met 
Leu/Val 
Ser/Asn 


T2R05 

77 
235 
338 
500 
638 
881 


AF227132 

G/T 
C/T 
C/T 
A/G 
G/A 
G/T 


GCCAGG 

zo 

79 
113 
167 
213 
294 


73/74 
Ser/Ile 
Arg/Cys 
Pro/Leu 
Tyr/Cys 
Arg/GIn' 
Arg/Leu 


T2R07 

254 
538 
640 
787 
788 
912 


AF227133 

T/C 
G/T 
C/T 
A/T 
C/T 
G/A 


TGCACG 

85 
180 
214 
263 
263 
304 


87/88 
He/Thr 
Ala/Ser 
Arg/Stop 
Thr/Ser 
Thr/Met 
Met/lie 


T2R08 

142 
370 
496 
829 


AF227134 

C/T 
T/G 
A/G 
T/C 


CTATA 

48 
124 
166 

277 


97/98 
Leu/Phe 
Trp/Gly 
Arg/GIy 
Tyr/His 
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! Gene 


GenBank Accession 


GenBank Allele 


GenBank Sequence 
SEO ED NOs.(NA/AAV 


! NA 
position 


NA Variants 


AA position 


AA Variants 


922 


A/G 


308 


Met/Val 


T2R09 

201 
381 
450 
[ ->OU 

i 697 
1 867 
| ooU 


AF227135 

C/A 
C/A 
T/A 
T/C 
G/A 
G/T 
C/A 


CCTTGGC 
67 
127 
150 
187 
233 
289 
294 


109/110 

Phe/Leu 
Asn/Lys 
Asp/Glu 
Val/Ala 
Ala/Thr 
Leu/Phe 
Leu/Met 


T2R10 

! 467 

I J/1 

691 


AF227136 

T/C 
A/C 
T/C 


CAT 
156 
174 
231 


131/132 

Met/Thr 
Lys/Thr 
Ser/Pro 


T2R13 

1 776 


AF227137 

A/G 


A 

259 


133/134 

Asn/Ser 


IZK14 

| 256 
589 


AF227138 

A/G 
A/G 


AA 

86 
197 


137/138 

Thr/Ala 
Met/Val 


T2R16 
301 
! 481 

DlO 

665 


AF227139 

G/A 
C/T 
T/G 
A/G 


GCTG 
101 
161 
172 
222 


145/146 

Val/Met 
Pro/Ser 
Asn/Lys 
His/Arg 


T2R38 

I 145 
239 
| 785 
1 820 
886 


AF494231 

C/G 
A/G 
C/T 
C/T 
G/A 


GATCA 

49 

80 
262 
274 
296 


165/166 

Pro/Ala 
His/Arg 
Ala/Val 
Arg/Cys 
Val/Ile 


T2R39 

578 
589 


AF494230 

C/T 
A/G 


CA 
193 
197 


167/168 

Ser/Phe 
Lys/Glu 


T2R40 

560 
817 
871 


AF494229 

C/A 
A/G 
G/A 


CAG 
187 
273 
291 


171/172 

Ser/Tyr 
Thr/Ala 
Gly/Ser 


T2R41 

380 


AF494232 

C/T 


TT 

127 


185/186 
Pro/Leu 
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Gene 


GenBank AccMrfm. 


_ GenBank Allele 


GenBank Sequence 
_ SEOIDNOs nVA/AA) 1 


NA 
position 


NA Variants 


AA position 


AA Variants 


[ 584 


T/A 


195 


Val/Asp 


kZRA't 
103 


AF494228 

T/C 


CAGCCCGCG 
35 
162 
200 
217 
219 
227 
240 
276 
281 


193/194 


484 
599 
649 
656 
680 

71 Q 

/ lo 
827 
843 


A/T 
G/A 
C/G 
C/T 
T/C 
A/G 
C/G 
G/T 


Trp/Arg 

Met/Leu 

Cys/Tyr 

Gln/Glu 

Pro/Leu 

Val/Ala 

Ile/Val 

Pro/Arg 

Trp/Cys 


T2R46 
106 
682 
749 
834 
862 


AF494227 

T/G 
A/T 
A/G 
C/G 
C/T 


1TGCC 

36 
228 
250 
278 
288 


207/208 
Phe/Val 
Met/Leu 
Stop/Tip 
He/Met 
Gin/Stop 


T2R47 

521 
577 
756 


AF494233 

A/G 
A/G 
G/T 


AAT 
174 
193 
252 


215/216 
His/Arg 
Ile/Val 
Leu/Phe 



T2R48 
94 
113 
376 
456 
673 
719 
799 
815 
895 



T2R49 

235 
421 
429 
442 
516 
706 
755 
764 
808 



AF494234 

G/A 
C/A 
A/C 
A/T 
A/G 
T/C 
G/C 
C/T 
T/C 



GCAAATGCC 

32 

38 
126 
152 
225 
240 
267 
272 
299 



AF494236 

A/G 

G/A 

C/A 

C/A 

G/A 

A/G 

T/C 

G/T 

A/G 



AGCCGATGA 

79 

141 

143 

148 

172 
236 
252 
255 
270 



223/224 
Val/Ile 
Thr/Lys 
Lys/Gln 
Arg/Ser 
Ile/Val 
Ile/Thr 
Val/Leu 
Pro/Leu 
Cys/Arg 



239/240 

Lys/Glu 
Val/Ile 
His/Gin 
His/Asn 
Met/He 
Ile/Val 
Phe/Ser 
Arg/Leu 
Ile/Val 
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T2R50 
155 
181 
608 


AF494235 
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G/A 


CGA 

52 

61 
203 


255/256 
Ala/Val 
Ala/Ser 
Cys/Tyr 


595 AY aT 4 !9 9 261/262 
1 : SEQ ID NO: (NA/AA) - Sequence ID number for Nucleic Acid / Amino ^StL™ 
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Of the cSNPs found, 92(72%) are nonsynonymous and 32(25%) are synonymous. 42.5% of 
nonsynonymous substitntions are non-conservative, and amino acid changes were observed across the 
entire coding region of these genes. Interestingly, we observed two segregating pseudogenes (SPGs); 
that is, T2R genes for which both intact and nonsense versions (null alleles) were segregating in the ' 
human sample. One is the T2R46 gene, which has two nonsense alleles, G749A and C862T. G749A 
has a null allele frequency of ~ 24% in all populations except Pygmy, while C862T was observed 
only once, in Cameroonians. The other SPG was observed in T2R7 and although it was observed 
only in Amerindians, the null allele displayed a frequency of 30% in this population. Although only 
two SPGs seem to exist in the entire repertoire of T2R genes, our result is consistent with 
observations in the olfactory receptor gene family, and suggests that T2R SPGs exist at different 

fre q uenciesmdifferentpopulations(Menashe e ra/., J H« WI .M 0 /. Genet. 11:1381-1390, 2002; Gilad, et 
al., Mol Biol. Evol. 20:307-314, 2002). 

Given the feet that different combinations of each polymorphic site in a bitter receptor gene 
can explain the phenotypic variation for a bitter compound in humans (Kim, et al, Science 299:1221- 
1225, 2003), we inferred possible functional haplotypes with nonsynonymous substitutions of each 
gene using Bayesian methods as implemented in the Phase package. In the 20 T2R genes that contain 
at least two cSNPs specifying amino acid change, we identified 109 different haplotypes. The 
number of haplotypes ranged from 2 to 9 for each gene. With the exception of T2R44 and T2R49, 
the most common haplotype of each gene was consistent across populations, and was observed in 111 
five different populations. 

To investigate the effects of natural selection on the bitter taste receptor genes as a group, 
we performed neutrality tests using Tajima's D statistic. Tajima's D compares the mean pairwise' 
difference between randomly chosen sequences in a sample with the number of segregating sites 
(Tajima, Genetics 123:585-595, 1989). Tajima's D test is usually performed under the assumption 
that population size has been constant. However, much evidence suggests that modern human 
populations are the product of an expansion that took place around 100,000 years ago, during which 
the human population soared from an initial effective size of roughly 10,000 (Excoffier, Curr Opin 
Gmet. Dev. 12:675-682, 2002; Klein, 77* Human Career: Human Biological and Cultural Origins 
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1999-^^, etal.,Annu. Rev. Genomics Hum. Genet. 1:361-385, 2000; Tishkoff; etal Annu 
Rev. Genomics Hum. Genet. 4:293-340,2003). Such growth is important to consider in neutrality ' 

ests because it can both mimic the effects of positive selection (, ,, selective sweeps) and obscure 
the effects of balancing selection (e.g. heterozygote advantage) (Bamshad, et al, Nat. Rev Genet 
4:99-1 1 1, 2003). For this reason, we tested the hypothesis of selective neutrality under both the 

^^rT* POPUlati0n ^ aSSUm P tionofl00 -^8rowth,100,000year S a g o,as 

described by Wooding et al. (Am. J. Hum. Genet. 74:637-646, 2004). 

When considered individuaUy, only two bitter taste genes, T2R1 6 and T2R49 showed 
sxgnificant departures from neutrality under both the assumption of constant population size and the 
assumpuon of population growth (Table 8). T2R16 had a highly negative D-value (-2 152 P < 0 01 
under growth) and T2R49 had a highly positive D-value (1 .856, P < 0.005 under growth) One ' 
additional gene, T2R13, showed a significant positive departure from expectation only under the 
assumption of population growth (p < 0.025). Three more T2R4, T2R44, and T2R60 had D values 
that we re high but not statistically significant under either model. When multiple comparisons were 
taken mto account using a Bonferroni correction, none of the D values departed significantly from 
expectation, although under growth the D value for T2R49 failed to reach significance only 
marginally. *" 3 



Table 8: Summary statistics for T2Rs 

T2R* Seg. „p d „ (%) p „ „ 

> 3 0.047 0.005 0.233 -1.618 ^? „,„ 

as as as as : I 

as as as ss 3 g 

I l as as as sjs £ : 

S 5S g r » g 
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Although none of the individual D values in our sample departed significantly from 
expectation, the distribution of D values did. When we compared the mean D value observed in the 
21 bitter taste receptor genes with theoretical expectations under the assumptions of (i) no population 
growth, and (ii) 100-fold growth, 100,000 years ago, the mean D value observed in our sample (- 
0.35) differed significantly from expectation under growth (-0.67) (P < 0.01). Further, bootstrap 
resampling tests showed that the mean observed D value was significantly greater than the mean D 
value (-1.00) m 160 genes resequenced as part of the NIH Environmental Genome Project (EGP) 
The EGP genes provide an appropriate comparison because they, like the taste receptors in our 
sample, are thought to be particularly important in mediating interactions between the human body 
and its environment (Wakefield, Environ. Health Perspect 1 10*757-759, 2002). As shown in 
F,gure 3, the EGP genes showed a distribution of D values similar to that expected under the 
assumption of 100-fold growth, 100,000 years ago. Thus, not only is the average D value in the bitter 
taste receptors significantly greater than expected under reasonable assumptions about human 
population history (U., growth), it is significantly greater than expected given a large sample of 
comparable genes (i.e., environmentaUy responsive genes). 

High values of Tajima's D are caused by a relative overabundance of SNPs with 
mtennediate frequencies (i.e., frequencies near 5 0 o/o). Such values often mdicate ^ presence q{ 
balancing natural selection; however, high D values can also be caused by population subdivision 

' etQl - ^ Ge " et 4: "- m ' 20 ° 3 >- To distinguish these alternatives we analyzed the 
"""^ ^ ^ V3lueS *** »™ m «» absence of population differentiation and values 
near unxty in the presence of extreme differentiation. FST values in our sample ranged from 0.02 to 
0.47 with a mean of 0.25. Published FST values based on DNA sequence variation in humans are 
usually lower, often falling around 0.15 (Tishkoffe, a/., ^„ M .^. Genomics Hum. Gene, 4293- 
340, 2003). For example, in a study of 25,549 SNPs, Akey etal. found an average FST of 0 123 
(GenomeRes. 12:1805-1814,2002). Bootstrap tests showed that the mean value observed in our 
dataset is significantly higher than that in the TSC dataset. The preponderance of high FST values in 
our sample suggests that human populations differ more with respect to variation in the bitter taste 
receptor genes than they do with respect to most other regions of the genome. 

The patterns of genetic variation found in the bitter taste receptor genes in general are 
ulustrated by the T2R49 gene. Among the genes we examined, T2R49 had the highest value of 
Tajima's D (1.86) and the third highest FST (0.33). The value of Tajima's D in T2R49 is 
significantly greater expected (P< 0.05 under constant population size, P< 0.01 undergrowth). 
The FST value rs significantly greater than zero (P < 0.01) and exceeds more than 8 5 o/ 0 of the FST 

Va 7 r T,to d . byAkey " 0/ - {Gen ° meReS - 12:1 *0 5 - 18 H*)02). Aminimumspanningtree 
relating T2R49 haplotypes showed that two common, distmct haplotype clusters differing from each 
other by s« nucleotide substitutions (including four amino acid substitutions) were present The 
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presence of two common, but distinct, clusters causes a high value of Tajima's D at this locus. As 
shown in Figure 4, each cluster was common in a different geographical region. While cluster 1 
accounted for 91.5% of observations in Africa and Europe, cluster 2 accounted for 65.0% of 
observations in Asia and Amerindians. The high frequency of each cluster in a different geographical 
region causes a high FST value at this locus. Further, nine out of 1 1 nucleotide substitutions in 
T2R49 cause amino acid substitutions. This ratio is extremely high compared to that observed in 
most genes, and is consistent with the hypothesis that positive natural selection has been active in the 
region (McDonald, etal, Nature 351:652-654, 1991; Nekrutenko, era/., GenomeRes. 12:198-202, 
2002). 



Example 5: Detecting Single Nucleotide Alterations 

T2R bitter taste receptor single nucleotide alterations, whether categorized as SNPs or new 
mutations can be detected by a variety of techniques in addition to merely sequencing the target 
sequence. Constitutional single nucleotide alterations can arise either from new germline mutations, 
15 or can be inherited from a parent who possesses a SNP or mutation in their own germline DNA. The 
techniques used in evaluating either somatic or germline single nucleotide alterations include 
hybridization using allele specific oligonucleotides (ASOs) (Wallace et ah, CSHL Symp. Quant. 
Biol. 51:257-261, 1986; Stoneking et ai, Am. J. Hum. Genet. 48:370-382, 1991), direct DNA 
sequencing (Church and Gilbert, Proc. Natl. Acad. Sci. USA 81:1991-1995, 1988), the use of 
20 restriction enzymes (Flavell et al, Cell 15:25, 1978; Geever et al, 1981), discrimination on the basis 
of electrophoretic mobility in gels with denaturing reagent (Myers and Maniatis, Cold Spring Harbor 
Symp. Quant. Biol. 51:275-284, 1986), RNase protection (Myers et al, Science 230:1242, 1985), 
chemical cleavage (Cotton etal, Proc. Natl. Acad. Sci. USA 85:4397-4401, 1985), and the ligase- 
mediated detection procedure (Landegrene/ al, Science 241:1077, 1988). 
25 Allele-specific oligonucleotide hybridization (ASOH) involves hybridization of probes to 

the sequence, stringent washing, and signal detection. Other new methods include techniques that 
incorporate more robust scoring of hybridization. Examples of these procedures include the ligation 
chain reaction (ASOH plus selective ligation and amplification), as disclosed in Wu and Wallace 
(Genomics 4:560-569, 1989); mini-sequencing (ASOH plus a single base extension) as discussed in 
Syvanen (Meth. Mol. Biol 98:291-298, 1998); and the use of DNA chips (miniaturized ASOH with 
multiple oligonucleotide arrays) as disclosed in Lipshutz etal (BioTechniques 19:442-447, 1995). 
Alternatively, ASOH with single- or dual-labeled probes can be merged with PCR, as in the 5'- 
exonuclease assay (Heid et al, Genome Res. 6:986-994, 1996), or with molecular beacons (as in 
Tyagi and Kramer, Nat. Biotechnol 14:303-308, 1996). 
35 Another technique is dynamic allele-specific hybridization (DASH), which involves 

dynamic heating and coincident monitoring of DNA denaturation, as disclosed by Howell et al. (Nat. 
Biotech. 17:87-88, 1999). A target sequence is amplified by PCR in which one primer is 
biotinylated. The biotinylated product strand is bound to a streptavidin-coated microtiter plate well, 
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and fee non-biotinylated strand is rinsed away with alkali wash solution. An oligonucleotide probe, 
specific for one allele, is hybridized to the target at low temperature. This probe forms a duplex DNA 
region that interacts with a double strand-specific intercalating dye. When subsequently excited, the 
dye emits fluorescence proportional to the amount of double-stranded DNA (probe-target duplex) 
present. The sample is men steadily heated while fluorescence is continually monitored. A rapid fell 
in fluorescence indicates fee denaturing temperature of fee probe-target duplex. Using this technique, 
a single-base mismatch between fee probe and target results in a significant lowering of melting 
temperature (J^ feat can be readily detected. 

Oligonucleotides specific to normal or allelic sequences can be chemically synthesized using 
commercially available machines. These oligonucleotides can then be labeled radioactively wife 
isotopes (such as 32 P) or non-radioactively, wife tags such as biotin (Ward and Langer et al, Proc. 
Natl. Acad. Sci. USA 78:6633-6657, 1981), and hybridized to individual DNA samples immobilized 
on membranes or other solid supports by dot-blot or transfer from gels after electrophoresis. These 
specific sequences are visualized by methods such as autoradiography or fluorometric (Landegren et 
15 al, Science 242:229-237, 1989) or colorimetric reactions (Gebeyehu et al, Nucleic Acids Res. 

15:4513-4534, 1987). Using an ASO specific for a normal allele, fee absence of hybridization would 
indicate a mutation in fee particular region of fee gene, or a deleted gene. In contrast, if an ASO 
specific for a mutant allele hybridizes to a sample then feat would indicate fee presence of a mutation 
in fee region defined by fee ASO. 
20 A variety of other techniques can be used to detect fee mutations or other variations in DNA. 

Merely by way of example, see U.S. Patents No. 4,666,828; 4,801,531; 5,110,920; 5,268,267; 
5,387,506; 5,691,153; 5,698,339; 5,736,330; 5,834,200; 5,922,542; and 5,998,137 for such methods. 
Additional methods include fluorescence polarization methods such as those developed by Pui Kwok 
and colleagues (see, e.g., Kwok,Hum. Mutat, 19(4):315-23, 2002), microbead methods such as those 
developed by Mark Chee at Illumina (see, e.g., Oliphant et al, Biotechniques. 2002 Jun;Suppl:56-8, 
60-61, Shen et al, Genet Eng. News, 23(6), 2003), and mass spectrophotometery methods such as 
those being developed at Sequenom (on fee Web at sequenom.com) (see, e.g., Jurinke et al, Methods 
MolBiol. 187:179-92, 2002; Amexis etal, Proc NatlAcadSci USA 98(21):12097-102, 2001; Jurinke 
et al, Adv Biochem Eng Biotechnol. 2002;77:57-74; Storm et al, Meth. Mol. Biol, 212:241 262, 
30 2002; Rodi et al, BioTechniques., 32:S62 S69, 2002); USPN 6,300,076; and WO9820166). 

Example 6: Differentiation of Individuals Homozygous versus Heterozygous for Mutation(s) 

Since it is believed feat fee haplotype of any taste receptor can influence fee perception of 
taste by a subject, it may sometimes be beneficial to determine whether a subject is homozygous or 
35 heterozygous for SNPs within any one or more of fee T2R bitter taste receptors described herein. 

By way of example, fee oligonucleotide ligation assay (OLA), as described atNickerson et 
al (Proc. Natl Acad. Sci. USA 87:8923-8927, 1990), allows fee differentiation between individuals 
who are homozygous versus heterozygous for alleles or SNPs indicated in Figure 1 or Table 7. This 
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feature allows one to rapidly and easily determine whether an individual is homozygous for at least 
one taste receptor variant, which condition believed to influence taste perception, particularly bitter 
taste reception, in the individual. Alternatively, OLA can be used to determine whether a subject is 
homozygous for either of these mutations. 

As an example of the OLA assay, when carried out in microtiter plates, one well is used for 
the detennmation of the presence of the T2R bitter taste allele in the T2R1 gene that contains an A at 
nucleotide position 332 (numbering from SEQ ID NO: 1) and a second well is used for the 
determination of the presence of the T2Rbitter taste allele in the same gene that contains a G at that 
nucleotide position in the alternate allele sequence. Thus, the results for an individual who is 
heterozygous for the mutation will show a signal in each of the A and G wells. 

Example 7: Bitter Taste Profiles 

W ^P™isionherefoofspecm c SNPs^ 
hnked to bxtter taste sensitivity to one or more bitter compounds, as well as SNPs and haplotypes that 
can be used to distinguish populations from each other, genetic profiles that provide information on 
the buter taste perception and/or regionality of a subject are now enabled. Such profiles are useful in 
mynad applications, including for instance selecting subjects for inclusion in (or exclusion from) a 
protocol (such as a taste test), 

Bitter taste-related genetic profiles comprise the distinct and identifiable pattern of alleles or 
haplotypes, or sets of of alleles or haplotypes, of the SNPs in bitter taste receptor molecules identified 
herem. The set of bitter taste receptors analyzed in a particular profile will usually include at least 
one of the following: T2R1, T2R3, T2R4, T2R5, T2R7, T2R8, T2R9, T2R10, T2R13 T2R14 
T2R16, T2R38, T2R39, T2R40, T2R41, T2R43, T2R44, T2R45, T2R46, T2R47, T2R48, T2R49 
T2R50,orT2R60. 

By way of example, any subset of the molecules listed in Figure 1 or Table 7 (or 
corresponding to the molecules in these lists) may be included in a single bitter taste profile. Specific 
examples of such subsets include those molecules that show a SNP that introduces a stop codon (e S 
the variant of T2R44 at position 843; the variant of T2R46 at position 749 or 86, or the variant of 
T2R48 at position 885); that show a novel SNP (e.g., those T2R genes with a "new" SNP indicated in 
Fagure 1); and so forth. Alternatively, gene profiles maybe further broken downby the type of bitter 
taste receptors included in the profile, for instance, those which all occur on a single chromosome 
<«*. CH 5, 7, or 12), or all of the haplotypes/isoforms of a single T2R gene. Specific contemplated 
subsets of sequences will include at least one of the following: two or more nucleotides selected 
fromSEQIDNO: 1,3,5,7,9, 11, 13, 15, 17, 19,21,23,25,27,29,31,33,35,37,39,41 43 45 
47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95 97 99 
101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 1 3 35 137 3 
141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173,' 1* m 19 
181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 21 1, 213, 215, 21 7 ' 219 
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221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 247, 249, 251, 253, 255, 257, 259, 

261, and 263; or two or more polypeptides having a sequence selected from SEQ ID NO- 2 4 6 8 
10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60 62 
64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108 110 112 
5 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150 152 ' 
154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188 190 192 
194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 23o' 232 
234, 236, 238, 240, 242, 244, 246, 248, 250, 252, 254, 256, 258, 260, 262, and 264; or at least one ' 
fragment from each of two or more of such molecules, which fragment overlaps a variant defined in 
10 any one of SEQ ID NOs: 1-46. 

The alleles/haplotypes of each bitter taste receptor included in a specific profile can be 
determined in any of various known ways, including specific methods provided herein. One 
particular contemplated method for detecting and determining the genotype and/or haplotype of 
multiple bitter taste receptors employs an array of allele-specific oligonucleotides which are used for 
qualitative and/or quantitative hybridization detection of the presence of specific alleles or SNPs in a 
sample from a subject 

Optionally, a subject's bitter taste profile can be correlated with one or more appropriate 
inhibitors or blockers of bitter taste, or other compounds that influence the ability of a subject to 
perceive a taste, which may be correlated with a control (or set of control) profile(s) condition linked 
to or associated with, for instance, sensitivity to one or a set of bitter compounds. Optionally the 
subject's bitter taste profile can be correlated with one or more appropriate treatments, for ins'tance 
treatments with compounds that inhibit or enhance the activity of one or more of the bitter taste 
alleles identified in the profile, or compositions in which the bitter taste of a component is 
specifically masked by a blocker that is added based on the information in the profile. 
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Example 8: Expression ofT2R Bitter Taste Receptor Variant Polypeptides 

The expression and purification of proteins, such as a T2R bitter taste receptor variant 
protein, can be performed using standard laboratory techniques, though these techniques are 
preferentially adapted to be fitted to express the T2R proteins. By way of example, techniques for 
expression of T2R family proteins is discussed in Wu et ah, PNAS 99:2392-2397, 2002 (incorporated 
herein by reference in its entirety). 

Additional examples of such method adaptations are discussed or referenced herein After 
expression, purified protein may be used for functional analyses, antibody production, diagnostics 
and patient therapy. Furthermore, the DNA sequences of the T2R bitter taste receptor variant cDNAs 
can be manipulated in studies to understand the expression of the gene and the function of its product 
Vanant or allelic forms of a human T2R bitter taste receptor genes may be isolated based upon 
information contained herein, and may be studied in order to detect alteration in expression patterns 
in terms of relative quantities, tissue specificity and functional properties of the encoded T2R bitter 
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taste receptor ™ian, protein ie . g ., mfhaanco m peicepaM of ^ ^ „ m 

WUCl1 - * fM *" - * °»y »• *— M express vnctoss 
M.tltods for exprasstog ,sage smounte of ^ ta . cloned gene ^ & 

»r^p»f^ lyte ^ WSB ^ mybeu ^ fMtepuii 
manual of proteins. F<)r mm ^ ^ ^ tf ^ 

to me for bacterial expression) linked to a T2R bitter taste receptor variant protein may be used 

~^»^^^™^c to .», W pb y ,fodi^ri c ^ to<I ^ 
ntetevelsof P rote»andto I ^p TOtetototoue! ^ M ^ eombyfaMo£iu()r 2 cence 

.ndnb. TT r tetamyalSObePmd ^ to ^ Wfo ' fimai °'>'' ls '»°ics. Mefoods 
fo^te^rt 111 y eC ^^' Producing fusion proteins and intact native proteins in culture are well known 
m foe art. and spectrin methods m described in Sambrook „ al. (In Ato War Oo*.: 
^W»rpMW,Cm 17, CSHL, New York, .989). Sucb fusion motems may ba made in la™, 

proband m bacteria by plaomg a steong, ragnlated printer and an efficient „Wme-b„ding site 
^^otatedg^mowbvateofp^^p^.^^ ^ 

fomtd m nteolnbie ntelusion ■todies. Mefooda for exacting proteins from foese aggmgates tun 
^dbys^katni.on^^^.. AmMyttm ^ ^ 

I . I v S1 *™ " W * " " PIeS8i '" , 0f '- Za »»° »— focluda foe P VK series 

M J i l ' ff °-40:. 8 3, ,985)^-3 (Studiaasnd 

Moffatt, y. Mol Biol 189:1 13, 1986). 

used, an snbgen. Tbe DNA sentence can slso ba mansfemed from its existmg context to otbar 
ctonmg valnclea, such as othex plss,mds, bacteriophage*, coonids, anrn., vieusea and yeas, aefflfcia. 
chro m o s „toe, ( YACa)(Buri t aa,n/..^,oa23 6 :80d.8.2, ,987). Tbeae vaotora may L bT 
tnteodttced into a varied of hosts includmg son«tic calls, and simpte or complex o^nisma sucb as 
bactena, fbngi (Timberltdce and I*** Sctou* 244:.313-,317, 1989), inverts, ptats 

^-^a-s^^w^^ ^^^^^ 
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For expression in mammalian cells, the cDNA sequence may be ligated to heterologous 
promoters, such as the simian virus (SV) 40 promoter in the P SV2 vector (Mulligan and Berg, Proc 
Natl. Acad. Sci. USA 78:2072-2076, 1981), and introduced into cells, such as monkey COS-1 cells ' 
(Gluzman, 0,7/23: 175- 182, 1981), to achieve transient or long-term expression. Thestable 
integration of the chimeric gene construct may be maintained in mammalian cells by biochemical 
selection, such as neomycin (Southern and Berg J.Mol.Appl. Genet. 1:327-341 1982) and 
mycophenolic acid (Mulligan and Berg Proc. Natl. Acad. Sci. USA 78:2072-2076, 1981) 

DNA sequences can be manipulated with standard procedures such as resection enzyme 
digestion, fill-in with DNA polymerase, deletion by exonuclease, extension by terminal * 
deoxynucleotide transferase, ligation of synthetic or cloned DNA sequences, site-directed sequence- 
alteration via single-stranded bacteriophage intermediate or with the use of specific oligonucleotides 
in combination with PCR or other in vitro amplification. 

The cDNA sequence (or portions derived from it) or a mini gene (a cDNA with an intron 
and its own promoter) may be introduced into eukaryotic expression vectors by conventional 
technique, These vectors are designed to permit the transcription of the cDNA in eukaryotic cells by 
providing regulatory sequences that initiate and enhance the transcription of the cDNA and ensure its 
proper splicing and polyadenylation. Vectors containing the promoter and enhancer regions of the 
SV40 or long terminal repeat (LTR) of the Rous Sarcoma virus and polyadenylation and splicing 
signal from SV40 are readily available (Mulligan et al, Proc. Natl. Acad. Sci. USA 78: 1078-2076 
1981; Gorman etal, Proc. Natl. Acad. Sci USA 78:6777-6781, 1982). The level of expression of 'the 
cDNA can be manipulated with this type of vector, either by using promoters that have different 
activities (for example, the baculovirus P AC373 can express cDNAs at high levels in S. Jrugiperda 
cells (Summers and Smith, In Genetically Altered Viruses and the Environ,™*, Fields et al (Eds) 
22:3 1 9-328, CSHL Press, Cold Spring Harbor, New York, 1985) or by using vectors that contain 
promoters amenable to modulation, for example, the glucocorticoid-responsive promoter from the 
mouse mammary tumor virus (Lee etal.Nature 294:228, 1982). The expression of the cDNA can 
be monitored in the recipient cells 24 to 72 hours after introduction (transient expression) 

In addition, some vectors contain selectable markers such as the gpt (Mulligan and Berg 
Proc. Natl. Acad. Sci. USA 78:2072-2076, 1981) or tieo (Southern and Berg J. Mot. Appl. Genet 
1:327-341, 1982) bacterial gene, These selectable markers permit selection of transfected cells that 
exhibit stable, long-term expression of the vectors (and therefore the cDNA). The vectors can be 
maintained in the cells as episomal, freely replicating entities by using regulatory elements of viruses 
such as papilloma (Sarver */., M>/. CellBiol. 1:486, 1981)orEpstein-Barr(Sugdene^/ Mol 
Cell Biol. 5:410, 1985). Alternatively, one can also produce cell lines that have integrated the vector 
into genomic DNA. Both of these types of cell lines produce the gene product on a continuous basis 
One can a, so Droduce cell lines ^ ^ ^ ^ rf ^ ^ ^ 

of the cDNA as well) to create cell lines that can produce high levels of the gene product (Alt et al., J 
Biol. Chetn. 253:1357, 1978). 
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The transfer of DNA into enkaryotic, in particular human or other mammalian cells, is now 
a conventional technique. The vectors are introduced into the recipient cells as pure DNA 
(transection) by, for example, precipitation with calcium phosphate (Graham and vander Eb 

52:466, 1973) or strong ' 

** USA 84:7413, 1987), DEAE dextran (McCuthan et al, J. Natl. Cancer Inst. 41-351 1968) 

USA 77.2163-2167, 1980), or pellet guns (Klein* al, Nature 327:70 , 1987) . MteMy ^ 
cDNA, or fragments thereof, can be introduced by infection with virus vectors. Systems are 
developed that use, for example, retroviruses (Bernstein et al., Gen. Engr'g 7:235, 1985), 
adenoviruses (Ahmad et al J Virnl sn-ofn ioo<\ « u . 

, osoa ™ * Q ' 611)68 V,ra8 (Spaete e< < 3 0:295, 

1982). T2R encoding sequences can also be delivered to target cells in vitro via non-infectious 
systems, for instance liposomes. 

These eukaryotic expression systems can be used for studies of T2R bitter taste receptor 
variant encoding nucleic acids and mutant forms of these molecules, T2R bitter taste receptor variant 
protems and mutant forms of these proteins. The eukaryotic expression system, may also be used to 
study the function of the normal complete protein, specific portions of the protein, or of naturaUy 
occurring or artificially produced mutant proteins. 

Using the above techniques, the expression vectors containing a T2R gene sequence or 
cDNA^ or fragments or variants or mutants thereof, can be introduced into human cells, mammalian 
cells fromother species or non-mammalian cells as desired. The choice of cell is determined by the 
purpose of the treatment. For example, monkey COS cells (Gluzmar, Cell 23:175-182, 1981) that 
produce high levels of the SV40 T antigen and permit the replication of vectors containing the SV40 
origin of replication may be used. Similarly, Chinese hamster ovary (CHO), mouse NIH 3T3 
fibroblasts or human fibroblasts or lymphoblasts may be used. 

^ PieSent *» compasses recombinant vectors that comprise all or part of the 

T2R bitter taste receptor variant gene or cDNA sequences, for expression in a suitable host. The T2R 
bitter teste receptor DNA is operatively linked in the vector to an expression control sequence in the 
recombinant DNA molecule so that a T2R bitter taste receptor polypeptide can be expressed. The 
expression control sequence may be selected from the group consisting of sequences that control the 
expression of genes of prokaryotic or eukaryotic cells and their viruses and combinations thereof 
The expression control sequence may be specifically selected from the group consisting of the lac 

lambda the control region of fd coat protein, the early and late promoters of SV40, promoters derived 
from polyoma, adenovirus, retrovirus, baculovirus and simian virus, the promoter for 
3-phosphoglycerate kinase, the promoters of veast *rM rsU^u + 

™h™ f a ^ • Phosphatase, the promoter of the yeast alpha- 

mating factors and combinations thereof. 
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One highly successful method of exoressino top >* a * • , 
nnrtinn - . . . , expressing T2R s to date is to engineer an anuno-terminal 

m«b«da f „, express T2Rs * «. „ „ „ ' ^ 

.0 iiasneceU, ^ ye ^^^ M »"'» ft «»^<»Pl«..l.« SK; <> lhmm 

expressed . to „ f „ ^ rabstotitny so]uWe &8 JJ -» »• 

Fraction of ProM„ Specific BUultngAgena 
Monoclonal or polyclonal antibodies may be produced *. . -i*. 

enitone n««™ii ^ therefore may provide a distinguishing 

protein, or vice versa, as discussed more fully herein. 

o p» tei nu»d^;rz^Tr y ^ ate8e,p ' otetaOTfomoftei -« 

cuiaooay conjugated to an enzyme such as alkaline 
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phosphatase. Application of an alkaline phosphatase substrate 5-bromo-4-chl 0 ro-3-indolyl 

Phospbate/nitro blue tetrazohum results in the production of a dense blue compound by 

— ocalfeed alkaline phosphatase. Annbodies that specmcally detect the target protein will, by 

5 ^^^^^^^^^^^^^^^ 
onthegeldeternnnedby.tsnrolecularweight). Non-specific binding of the antibody to other 
protenrsn^y occur andmaybe detectable as a weak signal on the Western blot. H,e non-specific 

W^rnblot relate to the strong primary signal arish^g from the specific antibody-target protein 

use as an nmnunogen xuay be isolated from the transfected or ixansfonned cehs as described above 
Concentrat.onof proteinor peptide inthe final preparation is adjusted, for example, by concentration 

antibody lo a» proKm can (hen be prepared ,s follows- 

15 

A. Monoclonal AnttbodyPraducdonbyHybridom. Fusion 

Monoclonal antibody to epitopes of foe targe, protein idellnr , ed ^ „ 

r, ^n^r ""^ hybrid ° ,Ms " :<:< " dtog • *• ^ ■* - 

(^ 2 5 M9 «97..975)ord^n^ tte «„ £ Briefly, a mouse b, rependvely 

z^r a r k micM8Mm ' >fa " se,ectedp ™' ei *' oveta ' , ^° fa ^^ 

nrTTr, * nab0d) '- I, ™ d " i " 8 ■* » f «« «— • H- »Pta cells are feed by 
^■T^ 1 ™ — myeloma cam, and foe excess un-fmedceu, deettoyedby 

25 offoecmtmersconumreu. Anubody-pmdu<» 8 clones am idenufled^^^ 

T~ *" ° f fc *— -> I-—* «* ae E LISA, „ origundry ^ „ y 

■We^i^a^^,,,^^,^ Selectedporiuve 
clonea can be expends end meir monoclone, anubody product bnrvetfed for Detailed 

30 Laboratory Manual, CSHL, New York, 1988). 

B. Polyclonal Antibody Producrien by Immunization 

be nteu^T" 1 ""^ C °°' >il,i ° 8 " »— «•»»- - P™*- con 

be peeped by numumzutg smufole annual, wifo fo, prorein, wbicb can be nnmodifled or 

35 ^"^^^ ^ecdvepolyclonttlandbodyprodocdon^ecMbymany 
recfocscelated^tofoe^^^ For example, smaU molecries K nd lo be J 
^^^^^m^^foeu^rf^,,,,^ 
m response to slK of mogadons end doe,, vrifo eitber inadequate or excessive doses of ana™ 
msulbng in low umr anrisem. Smtdl doses <ng lev.,, of antigen adrnimsrered « multiple mtrLerma. 
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sites appear to be most reliable. An effective innnnnization protocol for rabbits can be found in 
Vaitukauis et al. (J. Clin. Endocrinol. Metab. 33:988-991, 1971). 

titer thereof, as determined semi-quantitatively, for example, by double immunodiffusion in agar 

7a2 ^7;°™" ° f ^ to fe ^ee,for example, Oucbterlony.^dn 

Handbook of Expemnental Immunology, Wier, D. (ed.) chapter 19. Blackwell, 1973) Plateau 

Affhnty of the antisera for the antigen is determined by preparing competitive binding curves, as 
described, for example, by Fisher (Manual of Clinical Immunology, Ch. 42, 1980). 

C. Antibodies Raised against Synthetic Peptides 
A^^htomismgannbo^^ 

peptide (e . g ., a peptide that is specific to a variant T2R bitter taste receptor such as those disclosed 
herem) is to use one or more synthetic peptides synthesized on a commercially available peptide 
syn tWer based upon the predicted amino acid sequence of the protein or peptide. Polyclonal 
anhboches can be generated by injecting these peptides into, for instance, rabbits or mice 

D. Antibodies Raised by Injection of Encoding Sequence 

Antibodies may be raised against proteins and peptides by subcutaneous injection of a DNA 

heldformofmeBiolisticsystemCSanforde^/.^^/^.s,, Techju>J 5 . 27 _ 3J 
describedbyTange^/. (We 356:152-154, 1992). Expression vectors suitable for this purpose 
may mclude those that express the T2R bitter taste receptor-encoding sequence under the 
transcnption^controlof either ^^^ro^^^^^ 

Antibody preparations prepared according to these protocols are useful in quantitative 
umnunoassays which determine concentrations of antigen-bearing substances in biological samples- 
*ey are also used semi-quantitatively or qualitatively to identify the presence of antigen in a 
bunogical sample; or for immunolocalization of the specified protein. 

Optionally, antibodies, e.g., bitter taste receptor-specific monoclonal antibodies, can be 
humane by methods known in the art Antibodies with a desired binding specificity can be 
commeraally humanized (Scotgene, Scotland, UK; Oxford Molecular, Palo Alto, CA) 

E. Antibodies Specific for Specific T2R Taste Receptor Variants 

antibod.e that specially recognize these protein variants (and peptides derived therefrom) is 

Zol rr ular ' production of antibodies {and -* ■*■»— *«o mat 

^eatleastonev^antreceptorwimahigheraffh^^ 
canbeusedma M ly S1 s,d 1 agnosisandtieatinent(e.g.,inhibitiono^ 

perception), as weU as in study and examination of the T2R bitter taste receptor proteins themselves 
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In particular embodiments, it is beneficial to generate antibodies from a peptide taken from a 
vanation-specificregionofthedesiredT2Rbittertasterece P tor P rotein. By way of example, such 
regions include any peptide (usually four or more amino acids in length) that overlaps with one or 
more of the SNP-encoded variants described herein. More particularly, it is beneficial to raise 
antibodies against peptides of four or more contiguous amino acids that overlap the variants identified 
in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 40, 42, or 46, and 
particularly which comprise at least four contiguous amino acids including the residue(s) indicated in 
Figure 1 or Table7to be variable in different alleles of the specified T2R putative bitter taste 
receptors/isoforms. 

Longer peptides also can be used, and in some instances will produce a stronger or more 
reliable immunogenic response. Thus, it is contemplated in some embodiments that more than four 
amino acids are used to elicit the immune response, for instance, at least 5, at least 6, at least 8 at 
least 10, at least 12, at least 15, at least 18, at least 20, at least 25, or more, such as 30, 40, 50, or even 
longer peptides. Also, it will be understood by those of ordinary skill that it is beneficial in some 
instances to include adjuvants and other immune response enhancers, including passenger peptides or 
proteins, when using peptides to induce an immune response for production of antibodies. 

Embodiments are not limited to antibodies that recognize epitopes containing the actual 
mutation identified in each variant. Instead, it is contemplated that variant-specific antibodies also 
may each recognize an epitope located anywhere throughout the specified T2R bitter taste receptor 
variant molecule, which epitopes are changed in conformation and/or availability because of the 
activating mutation. Antibodies directed to any of these variant-specific epitopes are also 
encompassed herein. 

By way of example, the following references provide descriptions of methods for making 
antibodies specific to mutant proteins: Hills et al., {Int. J. Cancer, 63: 537-543, 1995); Reiter & 
Maihle (Nucleic Acids Res., 24: 4050-4056, 1996); Okamoto etal. (Br. J. Cancer, 73: 1366-1372 
1996);Nakayashikie/a/. s (yp M .y. Cancer Res., 91: 1035-1043, 2000); Gannon etal. (EMBOJ. 9- 
1595-1602, 1990); Wong et al. (Cancer Res., 46: 6029-6033, 1986); and Carney et al (J Cell ' 
Bioche m .,32:201-2U, 1986). Similar methods can be employed to generate antibodies specific to 
specific T2R bitter taste receptor variants. 

Example 10: Knockout and Overexpression Transgenic Animals 

Mutant organisms that under-express or over-express one or more specific alleles (isoforms) 
of one or more specific bitter taste receptor protein are useful for research. Such mutants allow 
insight into the physiological and/or psychological role of bitter taste perception in a healthy and/or 
pathological organism. These "mutants" are "genetically engineered," meaning that information in 
the form of nucleotides has been transferred into the mutant's genome at a location, or in a 
combination, in which it would not normally exist. Nucleotides transferred in this way are said to be 
"non-native >' For example, a non-bitter taste receptor promoter inserted upstream of a native bitter 
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taste receptor-encoding sequence would be non-native. An extra copy of a specific bitter taste 
receptor gene on a plasmid, transformed into a cell, would be non-native. 

Mutants may be, for example, produced from mammals, such as mice or rats, mat either 
express, over-express, or under-express a specific allelic variant or haplotype or diplotype of a 
denned bitter taste receptor (or combination of bitter taste receptors), or that do not express a 
specified receptor (or combination of receptors) at all. Over-expression mutants are made by 
increasing the number of specified genes in the organism, or by introducing a specific taste receptor 
allele into the organism under the control of a constitutive or inducible or viral promoter such as the 
mouse mammary tumor virus (MMTV) promoter or the whey acidic protein (WAP) promoter or the 
metallothionein promoter. Mutants that under-express a taste receptor, or that do not express a 
specific allelic variant of a taste receptor, may be made by using an inducible or repressible promoter, 
or by deleting the taste receptor gene, or by destroying or limiting me function of the taste receptor 
gene, for instance by disrupting the gene by transposon insertion. 

Antisense genes or molecules (such as siRNAs) may be engineered into the organism, under 
a constitutive or inducible promoter, to decrease or prevent expression of a specific T2R bitter taste 
receptor, as known to those of ordinary skill in the art 

A mutant mouse over-expressing a heterologous protein (such as a variant T2R bitter taste 
receptor protein) may be made by constructing a plasmid having a bitter taste receptor allele encoding 
sequence driven by a promoter, such as the mouse mammary tumor virus (MMTV) promoter or the 
whey acidic protein (WAP) promoter. This plasmid may be introduced into mouse oocytes by 
microinjection. The oocytes are implanted into pseudopregnant females, and the litters are assayed 
for insertion of the transgene. Multiple strains containing the transgene are then available for study. 

WAP is quite specific for mammary gland expression during lactation, and MMTV is 
expressed in a variety of tissues including mammary gland, salivary gland and lymphoid tissues. 
Many other promoters might be used to achieve various patterns of expression, e.g. , the 
metallothionein promoter. 

An inducible system may be created in which the subject expression construct is driven by a 
promoter regulated by an agent that can be fed to the mouse, such as tetracycline. Such techniques 
are well known in the art. 

A mutant knockout animal (e.g., mouse) from which a specific taste receptor gene is deleted 
can be made by removing all or some of the coding regions of the gene from embryonic stem cells 
The methods of creating deletion mutations by using a targeting vector have been described (Thomas 
and Capecchi, Cell 5 1 :503-5 12, 1987). 

35 Example 11: Knock-in Organisms 

In addition to knock-out systems, it is also beneficial to generate "knock-ins" that have lost 
expression of the native protein but have gained expression of a different, usually mutant or identified 
allelic form of the same protein. By way of example, any one or more of the allelic protein isoforms 
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V 50, 52 ' 54 > 56 > 58 > 6 0, 62, 64, 66 68 70 79 ia no 

resultant knock - ln organisms provide systems for studying taste rec^ri™ • * . 
10 taste of specific molecules is perceived. ^ h ° W ** 

TW of ordinary skill in the relevant art know methods of producing knock-in organisms 

2 ~: 7: r {mol cen 2i - ^ ~* — ■ - - ~. 

w '> Z001 )> Luo er al. {Oncogene, 20- 320-32R ?nriiVT„ 

1QOOV v v , ^ ^0-328,2001); Tomasson era/. 93: 1707-1714 

1999); Voncken era/. (, 86:4603-4611 100SV Ar,*^ . i „, , 

0 . V ' ° U;> 199 5), Andrae era/. (Mech. Dev., 107: 181-185 2001V 

Remertsene/a/. (Ge/ie^wr 6 301 114 ioo^ xx , M,/U ^' 

way ofcxamp.l - ^ "* " «** M = "Oft by 

a variant T2R bitter taste rcoepmrt, compounds to , toterac , ^ . , ... ™ > " " ^ " * ™° ° f 
a CD of a variant form of a T2R bitter taste receptor), compounds that interfere with the interact!^ f 
^~°». and * compotrnd, which modm*. *. achvity of. Mte ^ gene „ . 

mti r* Mrsenere8dato,y! ^^-™^-)»=dwhui na y 

Modulate ^Icreccpro^eitpmssio,. See, Platt, ys fo , ^ 269:28558 . 285(i2 

The compounds which may bo screened in accordance with the invention include bu..„ 
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pepndes.mcludmg but not limited to members of randompeptide libraries- (see l7 7 
Nature 354:82-84, 1991; Houghten et al, Nature 354-84 86 199. T' u ' 
- , , , ,., .'v«w**>4.84-8$ > 1991), and combinatorial chemistrv- 

organic or inorganic molecules. 

bitiertas* receptor or the activity of some othei ™ 
signal trwsduction pathway, receptor 

b *" - Having identified such a onpourf or composition, foe ^ 

fee ^!n^^^ bitter taste receptor (to reproduce the effect of an amino acid substitution sucb as 
foe eff^t^f d! >8 ' 1 * U ^ 0n ^ FTC gene (T2R38) for designing bitter taste blockeiSj or to^eproduce 
foe efiec, of foe probne substitution in the PTC gene for uesigm*, bitier b*. mhnic,, 
^^^^^inentified^ingmethods^j,,,,,^,,, ^ 

^°f»^s of.be reWoompo^d or compoaition wtth iB nature fcfo^T 
r"^" stiuctineoffoeJvlia 

— ™fo . a.,^ ^ ^ Qr ^ ^ ~y be 
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is compared to that of a "variant" of the specified taste receptor and, rather than solve the entire 
structure, the structure is solved for the protein domains that are changed. 

If an incomplete or insufficiently accurate structure is determined, the methods of computer 
based numerical modeling can be used to complete the structure or improve its accuracy. Any 
recognized modeling method may be used, including parameterized models specific to particular 
biopolymers such as proteins or nucleic acids, molecular dynamics models based on computing 
molecular motions, statistical mechanics models based on thermal ensembles, or combined models 
For most types of models, standard molecular force fields, representing the forces between 
constituent atoms and groups, are necessary, and can be selected from force fields known in physical 
chemistry. The incomplete or less accurate experimental structures can serve as constraints on the 
complete and more accurate structures computed by these modeling methods. 

Finally, having determined the structure of the active site, either experimentally, by 
modeling, or by a combination, candidate modulating compounds can be identified by searching 
databases containing compounds along with information on their molecular structure. Such a search 
1 5 seeks compounds having structures that match the determined active site structure and that interact 
with the groups defining the active site. Such a search can be manual, but is preferably computer 
assisted. These compounds found from this search are potential variant T2R bitter taste receptor 
modulating compounds. 

Alternatively, these methods can be used to identify improved modulating compounds from 
an already known modulating compound or ligand. The composition of the known compound can be 
modified and the structural effects of modification can be determined using the experimental and 
computer modeling methods described above applied to the new composition. The altered structure 
is then compared to the active site structure of the compound to determine if an improved fit or 
interaction results. In this manner systematic variations in composition, such as by varying side 
groups, can be quickly evaluated to obtain modified modulating compounds or ligands of improved 
specificity or activity. 

In another embodiment, the structure of a specified allelic taste receptor (the reference form) 
is compared to that of a variant taste receptor (encoded by a different allele of the same specified 
receptor). Then, potential bitter taste inhibitors are designed that bring about a structural change in 
the reference form so that it resembles the variant form. Or, potential bitter taste mimics are designed 
that bring about a structural change in the variant form so that it resembles another variant form, or 
the form of the reference receptor. 

Further experimental and computer modeling methods useful to identify modulating 
compounds based upon identification of the active sites of bitter compounds, various variants of the 
T2R bitter taste receptors described herein, and related transduction and transcription factors will be 
apparent to those of skill in the art. 

Examples of molecular modeling systems are the CHARMM and QUANTA programs 
(Polygen Corporation, Waltham, Mass.). CHARMm performs the energy minimization and 
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such aS BioDc»i m l lre . ( p ara ^ CaBf) ^ . ^^dable&omcoDspames 

ujn a or KNA, once that region is identified. 

could * deSCribed aW ^ referenCe t0 dCSign ^ <* confounds which 

could alter binding, one could also screen libraries of l™™,,. , ■ 

Compounds identified via assays such as those <Www u ■ 
desigiaiigbiltei taste blockers and mfaiics. 

g to; vanant T2R bitter taste receptor (including, but not limited to an BCD or a nv/rr. 
modulating the activity of "wild type" and/or "Variant" T2R bttw * > * 

The principle of assays used to identify compounds that bind to a variant T2R bitter 
receptor involves preparing a reaction mixture of a variant T2R bitter . ♦ 
testcompou^duuderconditionsandforatmiesuffi™ 

IT^n ^ P ^^^^^^-^goalofthescreenmg 
assay For example, where agonists or antagonists are sought, the full length variant T2R bitter taste 

molecule, a peptide corresponding to an BCD or » f. • • ^ 

g to an BCD or a fusion protein containing a variant T2R bitter taste 



WO 2005/007891 



PCT/US2004/019489 



-82- 



10 



15 



20 



25 



30 



35 



receptor ECD fused to a protein or polypeptide that affords advantages in the assay system (e g. 
labeling, isolation of the resulting complex, etc.) can be utilized. Where compounds that interact with 
the cytoplasmic domain are sought to be identified, peptides corresponding to a variant T2R bitter 
taste receptor CD and fusion proteins containing a variant T2R bitter taste receptor CD can be used. 

The screening assays can be conducted in a variety of ways. For example, one method to 
conduct such an assay would involve anchoring the variant T2R bitter taste receptor protein, 
polypeptide, peptide or fusion protein or the test substance onto a solid phase and detecting taste 
receptor/test compound complexes anchored on the solid phase at the end of the reaction. In one 
embodiment of such a method, the taste receptor reactant may be anchored onto a solid surface, and 
the test compound, which is not anchored, may be labeled, either directly or indirectly. 

In practice, microtiter plates may conveniently be utilized as the solid phase. The anchored 
component may be immobilized by non-covalent or covalent attachments. Non-covalent attachment 
may be accomplished by simply coating the solid surface with a solution of the protein and drying 
Alternatively, an immobilized antibody, preferably a monoclonal antibody, specific for the protein to 
be immobilized may be used to anchor the protein to the solid surface. The surfaces may be prepared 
in advance and stored. 

In order to conduct the assay, the nonimmobilized component is added to the coated surface 
containing the anchored component. After the reaction is complete, unreacted components are 
removed (e.g., by washing) under conditions such that any complexes formed will remain 
immobilized on the solid surface. The detection of complexes anchored on the solid surface can be 
accomplished in a number of ways. Where the previously nonimmobilized component is pre-labeled 
the detection of label immobilized on the surface indicates that complexes were formed. Where the ' 
previously nonimmobilized component is not pre-labeled, an indirect label can be used to detect 
complexes anchored on Ihe surface; e.g., using a labeled antibody specific for the previously 
nonimmobilized component (the antibody, in turn, may be directly labeled or indirectly labeled with a 
labeled anti-Ig antibody). 

Alternatively, a reaction can be conducted in a liquid phase, the reaction products separated 
from unreacted components, and complexes detected; e.g., using an immobilized antibody specific 
for a variant T2R bitter taste receptor protein, polypeptide, peptide or fusion protein or the test 
compound to anchor any complexes formed in solution, and a labeled antibody specific for the other 
component of the possible complex to detect anchored complexes. 

Alternatively, cell-based assays, membrane vesicle-based assays and membrane fraction- 
based assays can be used to identify compounds that interact with a variant T2R bitter taste receptor 
To this end, cell lines that express a variant T2R bitter taste receptor (or combination thereof) or cell 
hues (e.g, COS cells, CHO cells, HEK293 cells, etc.) have been genetically engineered to express 
variant T2R bitter taste receptor (e.g. , by transfection or transduction of taste receptor DNA) can be 
used. Interaction of the test compound with, for example, an ECD or a CD of a variant T2R bitter 
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taste receptor expressed by the host cell can be determined by comparison or competition with a bitter 
compound or analog thereof, such as PTC. 

A variant T2R bitter taste receptor polypeptide (such as those described herein) may be 
employed in a screening process for compounds which bind the receptor and which activate 
5 (agonists) or inhibit activation (antagonists) of the receptor polypeptide of the present invention. 
Thus, polypeptides described herein may also be used to assess the binding of small molecule 
substrates and ligands in, for example, cells, cell-free preparations, chemical libraries, and natural 
product mixtures. These substrates and ligands maybe natural substrates and ligands or maybe 
structural or functional mimetics. See Coligan et al. Current Protocols in Immunology 1 (2): Chapter 
10 5,1991. 

In general, such screening procedures involve providing appropriate cells which express a 
receptor polypeptide of the present invention on the surface thereof. Such cells include cells from 
mammals, insects, yeast, and bacteria. In particular, a polynucleotide encoding the receptor of the 
present invention is employed to transfect cells to thereby express a variant T2R bitter taste receptor. 
The expressed receptor is then contacted with a test compound to observe binding, stimulation or 
inhibition of a functional response. 

One such screening procedure involves the use of melanophores that are transfected to 
express a variant T2R bitter taste receptor. Such a screening technique is described in PCT WO 
92/01810, published Feb. 6, 1992, and incorporated herein by reference. Such an assay may be 
employed to screen for a compound which inhibits activation of a receptor of the present invention by 
contacting the melanophore cells which encode the receptor with both a receptor ligand, such as PTC 
or another bitter compound, and a compound to be screened Inhibition of the signal generated by the 
ligand indicates that a compound is a potential antagonist for the receptor, i.e, inhibits activation of 
the receptor. 

The technique may also be employed for screening of compounds which activate a receptor 
of the present invention by contacting such cells with compounds to be screened and determining 
whether such compound generates a signal, i.e., activates the receptor. 

Other screening techniques include the use of cells which express a variant T2R bitter taste 
receptor (for example, transfected CHO cells) in a system which measures extracellular pH changes 
caused by receptor activation. In this technique, compounds may be contacted with cells expressing a 
receptor polypeptide of the present invention. A second messenger response, e.g., signal transduction 
or pH changes, is then measured to determine whether the potential compound activates or inhibits 
the receptor. 

Another screening technique involves expressing a variant T2R bitter taste receptor in which 
35 the receptor is linked to phospholipase C or D. Representative examples of such cells include, but are 
not hmited to, endothelial cells, smooth muscle cells, and embryonic kidney cells. The screening 
may be accomplished as hereinabove described by detecting activation of the receptor or inhibition of 
activation of the receptor from the phospholipase second signal. 
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Another method involves screening for compounds which are antagonists, and thus inhibit 
actuation of a receptor polypeptide of the present invention by determining inhibition of binding of 
labeled ligand, such as PTC or another bitter compound, to cells which have the receptor on the 
surface thereof, or cell membranes containing the receptor. Such a method involves transfecting a 
eukaryotic cell with a DNA encoding a variant T2R bitter taste receptor such that the cell expresses 
the receptor on its surface, or using of eukaryotic cells that express the receptor of the present 
invennon on their surface (or using a eukaryotic cell that expresses the receptor on its surface) The 
cell is then contacted with a potential antagonist in the presence of a labeled form of a ligand, such as 
PTCoranotherbittercompound. The ligand can be labeled, e.g., by radioactivity. Theamountof 
labeled ligand bound to the receptors is measured, e.g. , by measuring radioactivity associated with 
transfected cells or membrane from these cells. If the compound binds to the receptor, the binding of 
labeled hgand to the receptor is inhibited as determined by a reduction of labeled ligand that binds to 
the receptors. This method is called a binding assay. 

Another such screening procedure involves the use of eukaryotic cells, which are transfected 
to express the receptor of the present invention, or use of eukaryotic cells that express the receptor of 
the present invention on their surface. The cells are loaded with an indicator dye that produces 
fluorescent signal when bound to calcium, and the cells are contacted with a test substance and 
receptor agonist, such as PTC or another bitter compound. Any change in fluorescent signal is 
measured over a defined period of time using, for example, a fluorescence spectrophotometer or a 
fluorescence imaging plate reader. A change in the fluorescence signal pattern generated by the 
hgand nKhcates that a compound is a potential antagonist (or agonist) for the receptor. 

Another such screening procedure involves use of eukaryotic cells, which are transfected to 
express the receptor of the present invention (or use of eukaryotic cells that express the receptor of 
the present invention), and which are also transfected with a reporter gene construct that is coupled to 
actuation of the receptor (for example, luciferase or beta-galactosidase behind an appropriate 
promoter). The ceUs are contacted with a test substance and a receptor agonist, such as PTC or 
another bitter compound, and the signal produced by the reporter gene is measured after a defined 
penodoftime. The signal can be measured using a luminometer, spectrophotometer, fluorimeter or 
other such mstrument appropriate for the specific reporter construct used. Inhibition of the signal' 
30 generated by the hgand indicates that a compound is a potential antagonist for the receptor. 

Another such screening technique for antagonists or agonists involves introducing RNA 
encoding a PTC taste receptor into Xenopus oocytes to transiently express the receptor. The receptor 
expressmg oocytes are then contacted with a receptor hgand, such as PTC, and a compound to be 
screened. Inhibition or activation of the receptor is then determined by detection of a signal, such as 
35 cAMP, calcium, proton, or other ions. 

Another such technique of screening for antagonists or agonists involves determining 
inlubition or stimulation of T2R taste receptor-mediated cAMP and/or adenylate cyclase 
accumulation or diminution. Such a method involves transiently or stably transfecting a eukaryotic 
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cell with a variant T2R bitter taste receptor to express the receptor on the cell surface (or using a 
eukaryotxc cell that expresses the receptor of the present invention on its surface). The cell is then 
exposed to potential antagonists in the presence of ligand, such as PTC or another bitter compound 
The amount of cAMP accumulation is then measured, for example, by radio-immuno or protein 
bmdmg assays (for example using Flashplates or a scintillation proximity assay). Changes in cAMP 
levels can also be determined by direcdy measuring the activity of the enzyme, adenylyl cyclase in 
broken cell preparations. If the potential antagonist binds the receptor, and thus inhibits taste receptor 
binding, the levels of variant T2R bitter taste receptor-mediated cAMP, or adenylate cyclase activity 
will be reduced or increased. Additional techniques for examining the activity of G-protein receptor' 
pathways, and components therein, are known to those of ordinary skill in the art. 

Example!* Assays for Intracellular Proteins that Interact mth a Variant T2R Bitter Taste 



Any method suitable for detecting protein-protein interactions may be employed for 
identifying transmembrane proteins or intraceUular proteins that interact with a variant T2R bitter 
taste receptor. Among the traditional methods which may be employed are co-immunoprecipitation, 
crosslinking and co-purification through gradients or chromatographic columns of cell lysates or 
protems obtained from cell lysates and a variant T2R bitter taste receptor to identify proteins in the 
lysate that interact with the PTC taste receptor. For these assays, a variant T2R bitter taste receptor 
component used can be a full length taste receptor, a soluble derivative lacking the membrane- 
anchonng region {e.g., a truncated taste receptor in which all TMDs are deleted resulting in a 
truncated molecule containing ECDs fused to CDs), a peptide corresponding to a CD or a fusion 
protein containing a CD of PTC taste receptor. 

Once isolated, such an intracellular protein can be identified and can, in turn, be used in 
conjunction with standard techniques, to identify proteins with which it interacta. Forexample at 
least a portion of the amino acid sequence of an intracellular protein which interacts with the variant 
T2R bitter taste receptor can be ascertained using techniques well known to those of skill in the art 
such as via the Edman degradation technique. See, e. g ., Creighton Protems: Structures and ' 
Molecular Principles, W.H. Freeman & Co., N.Y., pp. 34-49, 1983. The amino acid sequence 
obtained may be used as a guide for the generation of oligonucleotide mixtures that can be used to 
screen for gene sequences encoding such intracellular proteins. Screening may be accomplished for 
example, by standard hybridization or PGR techniques. Techniques for the generation of 
oligonucleotide mixtures and the screening are well known. See, e.g., Ausubel et al. Current 
Protocols in Molecular Biology Green Publishing Associates and Wiley Interscience, N Y 1989- and 
^9*90 CdS ' A to Metk0dS andA PP l ™*>™ Academic Press, Inc., New 

Additionally, methods may be employed which result in the simultaneous identification of 
genes, which encode the transmembrane or intraceUular proteins interacting with a variant T2R bitter 
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taste receptor. These methods include, for example, probing expression libraries, in a manner similar 
to the well known technique of antibody probing of Xgtl 1 libraries, using labeled PTC taste receptor 
protein, or a variant T2R bitter taste receptor polypeptide, peptide or fusion protein, e.g. , a variant 
T2R bitter taste receptor polypeptide or PTC taste receptor domain fused to a marker (e.g., an 
enzyme, fluor, luminescent protein, or dye), or an Ig-Fc domain. 

One method that detects protein interactions in vivo, the two-hybrid system, is described in 
detail for illustration only and not by way of limitation. One version of this system has been 
described (Chien et aL, PNAS USA 88:9578-9582, 1991) and is commercially available from 
Clontech (Palo Alto, Calif). 

Briefly, utilizing such a system, plasmids are constructed that encode two hybrid proteins: 
one plasmid consists of nucleotides encoding the DNA-binding domain of a Iranscription activator 
protein fused to a variant T2R bitter taste receptor nucleotide sequence encoding a variant T2R bitter 
taste receptor, a variant T2R bitter taste receptor polypeptide, peptide or fusion protein, and the other 
plasmid consists of nucleotides encoding the transcription activator protein's activation domain fused 
to a cDNA encoding an unknown protein which has been recombined into this plasmid as part of a 
cDNA library. The DNA-binding domain fusion plasmid and the cDNA library are transformed into 
a strain of the yeast Saccharomyces cerevisiae that contains a reporter gene (e.g., HBS or lacZ) 
whose regulatory region contains the transcription activator's binding site. Either hybrid protein 
alone cannot activate transcription of the reporter gene: the DNA-binding domain hybrid cannot 
because it does not provide activation function and the activation domain hybrid cannot because it 
cannot localize to the activator's binding sites. Interaction of the two hybrid proteins reconstitutes the 
functional activator protein and results in expression of the reporter gene, which is detected by an 
assay for the reporter gene product. 

The two-hybrid system or related methodology may be used to screen activation domain 
libraries for proteins that interact with the "bait" gene product. By way of example, and not by way 
of limitation, PTC taste receptor may be used as the bait gene product. Total genomic or cDNA 
sequences are fused to the DNA encoding an activation domain. This library and a plasmid encoding 
a hybrid of a bait variant T2R bitter taste receptor gene product fused to the DNA-binding domain are 
cotransformed into a yeast reporter strain, and the resulting transformants are screened for those that 
express the reporter gene. For example, and not by way of limitation, a bait variant T2R bitter taste 
receptor gene sequence, such as the open reading frame of variant T2R bitter taste receptor (or a 
domain of a taste receptor) can be cloned into a vector such that it is translationaUy fused to the DNA 
encoding the DNA-binding domain of the GAL4 protein. These colonies are purified and the library 
plasnuds responsible for reporter gene expression are isolated. DNA sequencing is then used to 
35 identify the proteins encoded by the library plasmids. 

A cDNA library of the cell line from which proteins that interact with bait variant T2R bitter 
taste receptor gene product are to be detected can be made using methods routinely practiced in the 
art. According to the particular system described herein, for example, the cDNA fragments can be 
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inserted into a vector such that they are translationally fused to the transcriptional activation domain 
of QAL*. This library can be co-transformed along with the bait PTC taste receptor gene-GAD* 
fusion plasmid into a yeast strain, which contains a lacZ gene driven by a promoter that contains 
GAL4 activation sequence. A cDNA encoded protein, fused to GAL4 transcriptional activation 
domain, that interacts with bait PTC taste receptor gene product will reconstitute an active GAL4 
protein and thereby drive expression of the HIS3 gene. Colonies, which express HIS3, can be 
detected by their growth on Petri dishes containing, semi-solid agar based media lacking histidine. 
The cDNA can then be purified from these strains, and used to produce and isolate the bait PTC taste 
receptor gene-interacting protein using techniques routinely practiced in the art 



Example 15: Assays for Compounds that Interfere with Taste Receptor /Intracellular or Taste 
Receptor /Transmembrane Macromolecule Interaction 

The macromolecules that interact with a variant T2R bitter taste receptor are referred to, for 
purposes of this discussion, as "binding partners". These binding partners are likely to be involved in 
a variant T2R bitter taste receptor signal transduction pathway, and therefore, in the role of taste 
receptors and taste receptor variants in bitter tasting. Therefore, it is desirable to identify compounds 
that interfere with or disrupt the interaction of such binding partners with variant and/or normal T2R 
bitter taste receptor, which may be useful in regulating the activity of variant T2R bitter taste 
receptors and control the sensitivity to bitter tastes associated with certain taste receptor activity. 

The basic principle of the assay systems used to identify compounds that interfere with the 
interaction between a variant T2R bitter taste receptor and its binding partner or partners involves 
preparing a reaction mixture containing variant T2R bitter taste receptor protein, polypeptide, peptide 
or fusion protein as described above, and the binding partner under conditions and for a time 
sufficient to allow the two to interact and bind, thus forming a complex. In order to test a compound 
25 for inhibitory activity, the reaction mixture is prepared in the presence and absence of the test 

compound. The test compound may be initially included in the reaction mixture, or may be added at 
a time subsequent to the addition of a variant T2R bitter taste receptor moiety and its binding partner. 
Control reaction mixtures are incubated without the test compound or with a placebo. The formation 
of any complexes between a variant T2R bitter taste receptor moiety and the binding partner is then 
30 detected. The formation of a complex in the control reaction, but not in the reaction mixture 

containing the test compound, indicates that the compound interferes with the interaction of a variant 
T2R bitter taste receptor and the binding partner. Additionally, complex formation within reaction 
mixtures containing the test compound and reference T2R bitter taste receptor variant may also be 
compared to complex formation within reaction mixtures containing the test compound and a 
different allelic or other variant of the same T2R taste receptor. This comparison may be important in 
those cases wherein it is desirable to identify compounds that disrupt interactions of reference but not 
variant T2R taste receptors, or differentially disrupt interactions between different variant T2R taste 
receptors. 
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The assay for compounds that interfere with the interaction of a variant T2R bitter taste 
receptor and binding partners can be conducted in a heterogeneous or homogeneous format. 
Heterogeneous assays involve anchoring either a variant T2R bitter taste receptor moiety product or 
the binding partner onto a solid phase and detecting complexes anchored on the solid phase at the end 
of the reaction. In homogeneous assays, the entire reaction is carried out in a liquid phase. In either 
approach, the order of addition of reactants can be varied to obtain different information about the 
compounds being tested. For example, test compounds that interfere with the interaction by 
competition can be identified by conducting the reaction in the presence of the test substance; i.e., by 
adding the test substance to the reaction mixture prior to or simultaneously with a variant T2R bitter 
taste receptor moiety and interactive binding partner. Alternatively, test compounds that disrupt 
preformed complexes, e.g., compounds with higher binding constants that displace one of the 
components from the complex, can be tested by adding the test compound to the reaction mixture 
after complexes have been formed. The various formats are described briefly below. 

In a heterogeneous assay system, either a variant T2R bitter taste receptor moiety or the 
interactive binding partner, is anchored onto a solid surface, while the non-anchoied species is 
labeled, either directly or indirectly. In practice, microtiter plates are conveniently utilized. The 
anchored species may be immobilized by non-covalent or covalent attachments. Non-covalent 
attachment may be accomplished simply by coating the solid surface with a solution of a variant T2R 
bitter taste receptor gene product or binding partner and drying. Alternatively, an immobilized 
antibody specific for the species to be anchored may be used to anchor the species to the solid 
surface. The surfaces may be prepared in advance and stored. 

In order to conduct the assay, the partner of the immobilized species is exposed to the coated 
surface with or without the test compound. After the reaction is complete, unreacted components are 
removed (e.g., by washing) and any complexes formed will remain immobilized on the solid surface. 
The detection of complexes anchored on the solid surface can be accomplished in a number of ways 
Where the non-immobilized species is pre-labeled, the detection of label immobilized on the surface 
indicates that complexes were formed. Where the non-immobilized species is not pre-labeled, an 
indirect label can be used to detect complexes anchored on the surface; e.g., using a labeled antibody 
specific for the initially non-immobilized species (the antibody, in turn, may be directly labeled or 
indirectly labeled with a labeled anti-Ig antibody). Depending upon the order of addition of reaction 
components, test compounds which inhibit complex formation or which disrupt preformed complexes 
can be detected. 

Alternatively, the reaction can be conducted in a liquid phase in the presence or absence of 
the test compound, the reaction products separated from unreacted components, and complexes 
detected; e.g., using an immobilized antibody specific for one of the binding components to anchor 
any complexes formed in solution, and a labeled antibody specific for the other partner to detect 
anchored complexes. Again, depending upon the order of addition of reactants to the liquid phase 
test compounds which inhibit complex or which disrupt preformed complexes can be identified. 
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In an alternate embodiment of the invention, a homogeneous assay can be used In this 
approach, a preformed complex of a variant T2R bitter taste receptor moiety and the interactive 
faodog partner is prepared in which either a variant T2R bitter taste receptor or its binding partner, is 
labeled, but the signal generated by the label is quenched due to formation of the complex (see . g . 
™.PatNo.4,109,496byRub^ ^ ' *' 

addxuon of a test substance that competes with and displaces one of the species from the preformed 
complex will result in the generation of a signal above background. In this way, test substances 
which disrupt PTC taste receptor/intracellular binding partner interaction can be identified ' 

In a particular embodiment, a variant T2R bitter taste receptor fusion can be prepared for 
nmnobmzation. For example, a variant T2R bitter taste receptor or a peptide fragment, e g . 
corresponding to a CD, can be fused to a glutathione-S-transferase (GST) gene using a fusion vector 
such as pGEX-SX-l, in such a manner that its binding activity is maintained in the resulting fusion ' 
protein. The interactive binding partner can be purified and used to raise a monoclonal antibody 
usmg methods routinely practiced in the art and described above. This antibody can be labeled with 
the radioactive isotope for example, by methods routinely practiced in the art Ina 
heterogeneous assay, e. S ., the GST-taste receptor fusion protein can be anchored to glutathione- 
agarose beads. The interactive binding partner can then be added in the presence or absence of the 
test compound in a manner that allows interaction and binding to occur. At the end of the reaction 
penod, unbound material can be washed away, and the labeled monoclonal antibody can be added to 
tte system and allowed to bind to the complexed components. The interaction between a variant T2R 
brtter taste receptor gene product and the interactive binding partner can be detected by measuring the 
amount of radioactivity that remains associated with the glutathione-agarose beads. Asuccessful 
mhibrtion of the interaction by the test compound will result in a decrease in measured radioactivity 
Alternatively, the GST-taste receptor fusion protein and the interactive binding partner can 
be ^ together in liquid in the absence of the solid glutathione-agarose beads. The test compound 
can be added erther during or after the species are allowed to interact This mixture can then be 
added to the glutathione-agarose beads and unbound material is washed away. Again the extent of 
* b ;; fa Vadant ^ ^ teSte -eptor/binding partner interaction can be detected by adding 
the labeled antibody and measuring the radioactivity associated with the beads 

In another embodiment, these same techniques can be employed using peptide fragments 
that correspond to the binding domains of a variant T2R bitter taste receptor and/or the interactive or 
bmdmg partner (in cases where the binding partner is a protein), in place of one or both of the full 
length proteins. Any number of methods routinely practiced in the art can be used to identify and 
xsolate the binding sites. These methods include, but are not limited to, mutagenesis of the gene 
encoding one of the proteins and screening for disruption of binding in a co-immuno P recipitation 
assay. Compensating mutations in the gene encoding the second species in the complex can then be 
selected. Sequence analysis of the genes encoding the respective proteins will reveal the mutations 
that correspond to the region of the protein involved in interactive binding. Alternatively, one protein 
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an b e anchored * , . solid surface usirtg rntfhods desclibed aW> ^ ^ „ ^ 
bred fo its labeled binding partner, ^ ch to ^ teated ^ , 

Aching,, short, ^Pepndeeon»rt^n„bn^ 8 domnn,„u yrem ^ a ^ ted ™ 
•to sohd tnateriu, which » ta isolaKd md i<fcntffied by ^ ^ once to 

gene coding for the intraceliultn binding partne, is obtain* gene segues can be engineered 
» express pepbde fregnrente of ft. ptofere, which can fhen be tested for binding acdvif, andpurffied 
or synthesized. 

F »^l=.«»o»o.byw. y „f ltaiMo ^ av ^ tmbitolasKre<!eptor 
oan b. anchored fo a solid materta! as deacrtbed, above, by rooting a GST-tesfe recepfos fusion 
protetn * Rowing i, „ bmd ,„ gfutadnone agtnose be.de. The reterachve biuthng partner can be 
Ubeted mm a radtoacdve isofope, such as »s, and cleaved with . p^ync ^ such „ 
Cleavage produce can then be added to the anchored GST-teste receptee fusion pro«ein and aZL 

btndtng partes, btedntg domain, can be eluted, purified, and analysed for annuo acid sconce by 
wefMcnown Pepfides ao fdenfified can be preduced syndtedcally or fused to appropriate 

facultative proteins using recombinant DNA technology. 

E»mph 16: Assays for Identification of Compound, that Modulate maer Tastes 

tho A ^r^'^^^^^^^^^^^aasay^hre^ssnch^ 
fi».e descrtbed above, oan be tested tor fhe .bitty to bitter ^ ^ 

that bind to a variant T2R hitter teste receptor, hddbi. binding of fhe nafurtd ^ ^ 
activate sign* frensduedon (agoniste, „ bloct ^ ^ 

8 ^T"™^^^--^*-.*** or compounds fh., affect 
v^^b^^^aptorg^^^,^^^^^ 

mcndreg molecules, eg., proteins „ « „^ 0 moleciues, „», ^ „ ^ °" 
to expression of dae fid. length vtuian, „, wfid-type rabifter teste receptor » bo roared) 
However^, should be noted dsa, the assays deacrtbed can also identify compounds tha, modulate 
vmfan, T2R bttter teste receptor signal transdueu™ (eg., which tdfecfdovroatream 

rtgnabng a™*,, .«„ as inhibitors „, ed,„ce„ of protein fduasos „, p^^ activities wbich 

T ttri Muo i* es ^ actt ™ ttdiyb ^ ofaMn °^°° a ^."' c )'<'"«ia W 

■""^ ^ — ^""^""'o.pad.way in v A ,ohav^ a .,raRbi«er teste 

"^^"^Tmbteteate^^p,^^^ 

rriT ^:!" eff " ,0f ^' rab '»"^«^»-'>«-^ to b,«ertestea 
for modulating bitter tastes. 
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Cell-based systems, membrane vesicle-based systems and membrane fraction-based systems 
can be used to identify compounds that may act to modulate bitter tastes. Such cell systems can 
include, for example, recombinant or non-recombinant cells, such as cell lines, which express the 
PTC taste receptor gene. In addition, expression host cells {e.g., COS cells, CHO cells, HEK293 

5 cells) genetically engineered to express a functional variant T2R bitter taste receptor and to respond 
to activation by the natural ligand, e.g., as measured by a chemical or phenotypic change, induction 
of another host cell gene, change in ion flux (e.g., Ca 2+ ), phosphorylation of host cell proteins, etc., 
can be used as an end point in the assay. 

In utilizing such cell systems, cells may be exposed to a compound suspected of exhibiting 

10 an ability to modulate bitter tastes, at a sufficient concentration and for a time sufficient to elicit such 
a modulation in the exposed cells. After exposure, the cells can be assayed to measure alterations in 
the expression of a variant T2R bitter taste receptor gene, e.g., by assaying cell lysates for PTC taste 
receptor mRNA transcripts (e.g. , by Northern analysis) or for variant T2R bitter taste receptor protein 
expressed in the cell; compounds which regulate or modulate expression of a variant T2R bitter taste 

15 receptor gene are good candidates as therapeutics. Alternatively, the cells are examined to determine 
whether one or more cellular phenotypes has been altered to resemble a taster or nontaster type. Still 
further, the expression and/or activity of components of the signal transduction pathway of which a 
variant T2R bitter taste receptor is a part, or the activity of a T2R bitter taste receptor signal 
transduction pathway itself can be assayed. 

20 For example, after exposure, the cell lysates can be assayed for the presence of 

phosphorylation of host cell proteins, as compared to lysates derived from unexposed control cells. 
The ability of a test compound to inhibit phosphorylation of host cell proteins in these assay systems 
indicates that the test compound alters signal transduction initiated by taste receptor activation. The 
cell lysates can be readily assayed using a Western blot format; i.e., the host cell proteins are resolved 

25 by gel electrophoresis, transferred and probed using a detection antibody (e.g., an antibody labeled 

with a signal generating compound, such as radiolabel, fluor, enzyme, etc.), see, e.g., Glenney et al, J 
Immunol Methods 109:277-285, 1988; Frackelton et al, Mol Cell Biol 3:1343-1352, 1983. 
Alternatively, an ELISA format could be used in which a particular host cell protein involved in the 
taste receptor signal transduction pathway is immobilized using an anchoring antibody specific for 

30 die target host cell protein, and the presence or absence of a phosphorylated residue on the 

immobilized host cell protein is detected using a labeled antibody. (See, e.g., King et al, Life Sci 
53:1465-1472, 1993). 

In yet another approach, ion flux, such as calcium ion flux, can be measured as an end point 
for PTC taste receptor stimulated signal transduction. Calcium flux can be measured, for instance, 
35 using calcium-sensitive dyes or probes, many of which are known to those of ordinary skill in the art. 
Examples of ion sensitive fluorophores include, but are not limited to, bis-(l,3-dibutylbarbituric 
acid)trimethine oxonol (DiBAC4(3) (B-438), Quin-2 (AM Q-1288), Fura-2 (AM F-1225), Indo-1 
(AM 1-1226), Fura-3 (AM F-1228), Fluo-3 (AM F-1241), Rhod-2, (AM R-1244), BAPTA (AM B- 
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1205), S.S'-dimethyl BAPTA (AM D-1207), 4,4'-difluoro BAPTA (AM D-12 16), 5,5'-difluoro 
BAPTA (AM D-1209), 5,5'-dibromo BAPTA (AM D-1213), Calcium Green (C-301 1), Calcium 
Orange (C-3014), Calcium Crimson (C-3017), Fura-5 (F-3023), Fura-Red (F-3020), SBFI (S-1262) 
PBFI (P-1265), Mag-Fura-2 (AM M-1291), Mag-Indo-1 (AM M-1294), Mag-Quin-2 (AM M-1299^ 
Mag-Quin-1 (AM M-1297), SPQ (M-440), SPA (S-460), Calcien (Fluorescein- 
bis(memyliminodiacetic acid); Fluorexon), and Quin-2 (2-{[2-Bis-(carboxymemyl)amino-5- 
memylphenoxy]-memyl}-6^^^ tetrapotassium salt). 

Example 17: Other Assays for Modulators of Variant T2R Bitter Taste Receptors 
A. Assays for Taste Receptor Protein Activity 

T2R bitter taste receptor family members are G-protein coupled receptors that participate in 
taste transduction, e.g., bitter taste transduction. The activity of a T2Rbitter taste receptor protein 
variants can be assessed using a variety of in vitro and in vivo assays to determine functional, 
chemical, and physical effects, e.g., measuring ligand binding ( e . g . , radioactive ligand binding) 
second messengers (e.g., cAMP, cGMP, n» 3 , DAG, or Ca~>, ion flux, phosphorylation levels ' 
transcription levels, neurotransmitter levels, and the like. Furthermore, such assays can be used to 
test for mhibitors and activators of identified T2R bitter taste receptor family member variants 
Modulators can also be genetically altered versions of taste receptors. Such modulators of taste 
transduction activity are useful for customizing taste, for example to modify the detection of bitter 



Modulators of a T2R bitter taste receptor protein variant activity are tested using taste 
receptor polypeptides as described herein, either recombinant or naturally occurring. The protein can 
be elated, expressed in a cell, expressed in a membrane derived from a cell, expressed in tissue or in 
an animal, either recombinant or naturally occurring. For example, tongue slices, dissociated cells 
from a tongue, transformed cells, or membranes can be used. Modulation is tested using one of the in 
vitro or in vivo assays described herein. Taste transduction can also be examined in vitro with soluble 
or solid state reactions, using a full-length taste receptor or a chimeric molecule such as an 
extracellular domain or transmembrane domain, or combination thereof, of a taste receptor variant 
covalently linked to a heterologous signal transduction domain, or a heterologous extracellular 
domam and/or transmembrane domain covalently linked to the transmembrane and/or cytoplasmic 
domain of a T2R bitter taste receptor protein variant. Furthermore, ligand-binding domains of the 
protem of interest can be used in vitro in soluble or solid state reactions to assay for ligand binding 
In numerous embodiments, a chimeric receptor will be made that comprises all or part of a T2R bitter 
taste receptor protein variant as well an additional sequence that facilitates the localization of the taste 
receptor to the membrane, such as a rhodopsin, e.g., an N-terminal fragment of a rhodopsiu protein 

Ligand binding a T2R bitter taste receptor protein variant, a domain, or chimeric protein can 
be tested in solution, in a bilayer membrane, attached to a solid phase, in a lipid monolayer or in 
vesicles. Binding of a modulator can be tested using, e.g., changes in spectroscopic characteristics 
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(**., fluorescence, absorbance, refractive index) hydxodynamic (e. g ., shape), chromatographic, or 
solubility properties. 

Receptor-G-protein interactions can also be examined. For example, binding of the G- 
protein to the receptor or its release from the receptor can be examined. For example, in the absence 
of GTP, an activator will lead to the formation of a tight complex of a G protein (all three known 
subnnits) with the receptor. Ibis complex can be detected in a variety of ways, as noted above. Such 
an assay can be modified to search for inhibitors, e.g., by adding an activator to the receptor and G 
protein in the absence of GTP, which form a tight complex, and then screen for inhibitors by looking 
at dissociation of the receptor-G protein complex. In the presence of GTP, release of the known 
alpha subunit of the G protein from the other two known G protein subnnits serves as a criterion of 
activation. 

In a convenient embodiment, T2R bitter taste receptor protein variant-gustducin interactions 
are momtored as a function of taste receptor activation. One taste-cell specific G protein that has 
been idenbfied is called gustducin (McLaughlin et al. Nature 357:563-569, 1992). Such ligand 
dependent coupling of taste receptors with gustducin can be used as a marker to identify modifiers of 
the T2R bitter taste receptor protein variant. 

An activated or inhibited G-protein will in turn alter the properties of target enzymes 
channels, and other effector proteins. The classic examples are the activation of cGMP 
phosphodiesterase by transducin in the visual system, adenylate cyclase by the stimulatory G-protein, 
phosphohpase C by Gq and other cognate G proteins, and modulation of diverse channels by Gi and 
other G proteins. Downstream consequences can also be examined such as generation of diacyl 
glycerol and IP3 by phosphohpase C, and in turn, for calcium mobilization by IP3. 

In a convenient embodiment, a T2R bitter taste receptor protein variant is expressed in a 
eukaryohc cell as a chimeric receptor with a heterologous, chaperone sequence that facilitates its 
maturate and targeting through the secretory pathway, m a preferred ernbodiment, the heterologous 
sequence is a rhodopsin sequence, such as anN-terminal leader of a rhodopsh, Such chimeric taste 
receptors can be expressed in any eukaryotic cell, such as HEK293 cells. Preferably the cells 
comprise a functional G protein, e. g ., Gal 5, that is capable of coupling the chimeric receptor to an 
mtracellular signaling pathway or to a signaling protein such as phosphohpase C P . Activation of 
such chimeric receptors in such cells can be detected using any standard method, such as by detecting 
changes in mtracellular calcium by detecting FURA-2 dependent fluorescence in the cell. 

An activated G-protein coupled receptor (GPCR) becomes a substrate for kinases that 
phosphorylate the C-terminal tail of the receptor (and possibly other sites as well). Thus, activators 
wrll promote the transfer of «B from gamma-labeled GTP to the receptor, which can be assayed with 
a scmnllation counter. The phosphorylation of the C-terminal tail will promote the binding of 
arrestin-like proteins and will interfere with the binding of G-proteins. The kinase/arrestin pathway 
Plays a key role in the desensitization of many GPCR receptors. For example, compounds that 
modulate the duration a taste receptor stays active would be useful as a means of prolonging a desired 
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taste or cutting off an unpleasant one. For a general review of GPCR signal transduction and 

methods of assaying signal transduction, see, , g ., Methods in Ecology, vols. 237 and 238 (1994) 

and volume 96 (1983); Bourne era/., iVaft/re 10-349117 127 looi. b , . 

,„,„.,,, , t amre W - 549A17 - 127 ' 19 ^^omn & etal.. Nature 348:125- 

uz, 1990; Pxtcher et al, Annu Rev Biochem 67:653-692, 1998. 

Samples or assays that are treated with a potential T2R bitter taste receptor protein variant 
***** or activator are compared to control samples without the test compound, to examine the 
extent o modulation. Such assays may be carried out in the presence of a bitter tastant that is known 
to activate the particular receptor, and modulation of the bitter-tastant-dependent activation 
monitored. Control samples (untreated with activators or inhibitors) are assigned a relative T2R 
bitter taste receptor protein activity value of 100. Inhibition of a T2R bitter taste receptor protein 
variant is achieved when the T2R bitter taste receptor protein variant activity value relative to the 
control* about 90%, optionally 5 0 o/ 0 , opuonafly 25^%. Activation ofaKR bitter taste receptor 

the control is 110%, optionally 150%, 200-500%, or 1000-2000%. 

po ennal) of the cell or membrane expressing a T2R bitter taste receptor protein variant. One means 
to determine changes in cellular polarization is by measuring changes in current (thereby measuring 
changes ; in polarization) with voltage-clamp and patch-clamp techniques, e. g ., the "cell-attached" 
mode the "inside-out" mode, and the "whole cell" mode (see, , g , Ackerman et al, New Engl JMed 
336:1575-1595, 1997). Whole cell „ are conveniendy determined using die standard 
"T^^^^^ 1 ^^^ 1981). Other knownassays include- 
radxolabeled ion flux assays and fluorescence assays using voltage-sensitive dyes (see e g. 
Veste^ 

277 1997; Darnel etal, J Pharmacol Meth 25.185-193, 1991; Holevinsky et al, J Membrane 

teTjmT 9 " 7 ^ 

™ eeffec *°™etestcomp^^ 
exannnmganyoftheparametersdescnWa 

GPCRactivitycanbe used to assess the influence ofatest compound on the polypeptides of this 
nivention. When the functional consequences are determined using intact cells or animals, one can 
also „ variety of effects such as ^^^ }m ^ 1mmj ^ 
to both known and ^characterized genetic markers (e. g , northern blots), changes in cell metabolil 

Convenient assays for G-protein coupled receptors include cells that are loaded with ion or 
voltage sensrhve dyes to report receptor activity. Assays for deterniining activity of such receptors 
can also use known agonists and antagonists for other G-protein coupled receptors as negative or 
positive controls to assess activity of tested compounds. In assays for identifying modulatory 
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compounds {e.g., agonists, antagonists), changes in the level of ions in the cytoplasm or membrane 
voltage will be monitored using an ion sensitive or membrane voltage fluorescent indicator, 
respectively. Among the ion-sensitive indicators and voltage probes that may be employed are those 
disclosed in the Molecular Probes 1997 Catalog. For G-protein coupled receptors, promiscuous G- 
5 proteins such as Gal5 and G<xl6 can be used in the assay of choice (Wilkie et al. PNAS USA 

88:10049-10053, 1991). Such promiscuous G-proteins allow coupling of a wide range of receptors. 

Receptor activation typically initiates subsequent intracellular events, e.g., increases in 
second messengers such as IP3, which releases intracellular stores of calcium ions. Activation of 
some G-protein coupled receptors stimulates the formation of inositol triphosphate (IP3) through 

10 phospholipase C-mediated hydrolysis of phosphatidylinositol (Berridge & Irvine Nature, 312-315- 
321, 1984). IP3inturnstimulates the release of intraceUular calcium ion stores. Thus, a change in 
cytoplasmic calcium ion levels, or a change in second messenger levels such as IPS can be used to 
assess G-protein coupled receptor function. Cells expressing such G-protein coupled receptors may 
exhibit increased cytoplasmic calcium levels as a result of contribution from both intracellular stores 

15 and via activation of ion channels, in which case it may be desirable although not necessary to 

conduct such assays in calcium-free buffer, optionally supplemented with a chelating agent such as 
EGTA, to distinguish fluorescence response resulting from calcium release from internal stores. 

Other assays can involve determining the activity of receptors which, when activated, result 
in a change in the level of intracellular cyclic nucleotides, e.g., cAMP or cGMP, by activating or 

20 inhibiting enzymes such as adenylate cyclase. There are cyclic nucleotide-gated ion channels e g. 
rod photoreceptor cell channels and olfactory neuron channels that are permeable to cations upon ' 
activation by binding of cAMP or cGMP (see, e.g., Altenhofen etal, PNAS USA 88:9868-9872 
1991; andDhallanrf*/., Afefcre 347:184-187, 1990). Incases where activation of the receptor' 
results in a decrease in cyclic nucleotide levels, it may be preferable to expose the cells to agents that 

25 mcrease intracellular cyclic nucleotide levels, e.g., forskolin, prior to adding a receptor-activating 
compound to the cells in the assay. Cells for this type of assay can be made by co-transfection of a 
host cell with DNA encoding a cyclic nucleotide-crated ion channel, GPCR phosphatase and DNA 
encoding a receptor {e.g., certain glutamate receptors, muscarinic acetylcholine receptors, dopamine 
receptors, serotonin receptors, and the like), which, when activated, causes a change in cyclic 

3 0 nucleotide levels in the cytoplasm. 

In a convenient embodiment, a T2R bitter taste receptor protein variant activity is measured 
by expressing a T2R bitter taste receptor protein variant gene in a heterologous cell with a 
promiscuous G-protein that links the receptor to a phospholipase C signal transduction pathway (see 
Offermanns& Simon, JBiolChem 270:15175-15180, 1995). Optionally the cell line is HEK293 
35 (which does not naturally express PTC taste receptor genes and the promiscuous G-protein is Gccl5 
(Offermanns& Simon, 1995). Modulation of taste transduction is assayed by measuring changes in 
intracellular Ca 2+ levels, which change in response to modulation of the a T2R bitter taste receptor 
protein variant signal transduction pathway via administration of a molecule that associates with a 
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T2R bitter taste receptor protein variant Changes in Ca 2+ levels are optionally measured using 
fluorescent Ca 2+ indicator dyes and fluorometric imaging. Examples of ion sensitive dyes and probes 
include, but are not limited to, bis-(l,3-dibutylbarbituric acid)trimethine oxonol (DiBAC4(3) (B-438), 
Quin-2 (AM Q-1288), Fura-2 (AM F-1225), Indo-1 (AM 1-1226), Fura-3 (AM F-1228), Fluo-3 (AM ' 
5 F-1241), Rhod-2, (AM R-1244), BAPTA (AM B-1205), 5,5'-dimethyl BAPTA (AM D-1207), 4,4'- 
difluoro BAPTA (AM D-1216), 5,5'-difluoro BAPTA (AM D-1209), 5,5'-dibromo BAPTA (AM D- 
1213), Calcium Green (C-301 1), Calcium Orange (C-3014), Calcium Crimson (C-3017), Fura-5 (F- 
3023), Fura-Red (F-3020), SBFI (S-1262), PBFI (P-1265), Mag-Fura-2 (AM M-1291), Mag-Indo-1 
(AM M-1294), Mag-Quin-2 (AM M-1299), Mag-Quin-1 (AM M-1297), SPQ (M-440), SPA (S-460), 
10 Calcien (Fluorescem-bis(memyliminodiacetic acid); Fluorexon), and Quin-2 (2-{[2-Bis- 

(carboxyme%l)ammo-5-memylphenoxy]-memyl}-6-memoxy-8-bk-(carbox^ 
tetrapotassium salt). 

In one embodiment, the changes in intracellular cAMP or cGMP can be measured using 
immunoassays. The method described in Offermanns & Simon {J Biol Chem 270:15175-15180, 
15 1995), for instance, may be used to determine the level of cAMP. Also, the method described in 
Felley-Bosco etal. {Am JResp Cell and Mol Biol 11:159-164, 1994) may be used to determine the 
level of cGMP. Further, an assay kit for measuring cAMP and/or cGMP is described in U.S. Patent 
4,115,538. 

In another embodiment, phosphatidyl inositol (PI) hydrolysis can be analyzed according to 
20 U.S. Patent 5,436,128. Briefly, the assay involves labeling of cells with 3 H-myoinositol for 48 or 
more hours. The labeled cells are Heated with a test compound for one hour. The treated cells are 
lysed and extracted in chloroform-methanol-water after which me inositol phosphates are separated 
by ion exchange chromatography and quantified by scintillation counting. Fold stimulation is 
determined by calculating the ratio of cpm in the presence of agonist to cpm in Ihe presence of buffer 
25 control. Likewise, fold inhibition is determined by calculating the ratio of cpm in the presence of 
antagonist to cpm in the presence of buffer control (which may or may not contain an agonist). 

In another embodiment, transcription levels can be measured to assess the effects of a test 
compound on signal transduction. A host cell containing a T2R bitter taste receptor protein variant of 
interest is contacted with a test compound for a sufficient time to effect any interactions, and then the 
30 level of gene expression is measured. The amount of time to effect such interactions may be 

empirically determined, such as by running a time course and measuring the level of transcription as 
a function of time. The amount of transcription may be measured by using any method known to 
those of skill in the art to be suitable. For example, mRNA expression of the protein of interest may 
be detected using northern blots or their polypeptide products may be identified using immunoassays. 
Alternatively, transcription based assays using reporter genes may be used as described in U.S. Patent 
5,436,128. The reporter genes can be, e.g., chloramphenicol acetyltransferase, luciferase, P- 
galactosidase and alkaline phosphatase. Furthermore, the protein of interest can be used as an 
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indirect reporter via attachment to a second reporter such as green fluorescent protein (see, e.g. , 
Misnli & Spector, Nature Biotechnology 15:961-964, 1997). 

The amount of transcription is then compared to the amount of transcription in either the 
same cell in the absence of the test compound, or it may be compared with the amount of 
5 transcription in a substantially identical cell that lacks the protein of interest. A substantially identical 
cell may be derived from the same cells from which the recombinant cell was prepared but which had 
notbeenmodifiedbyintroductionofheterologousDNA. Any difference in the amount of 
transcription indicates that the test compound has in some manner altered the activity of the protein of 
interest. 

10 B. Modulators 

The compounds tested as modulators of a T2R bitter taste receptor family member variant 
can be any small chemical compound, or a biological entity, such as a protein, sugar, nucleic acid or 
lipid. Alternatively, modulators can be genetically altered versions of a T2R bitter taste receptor 
protein gene. Typically, test compounds will be small chemical molecules and peptides. Essentially 
1 5 any chemical compound can be used as a potential modulator or ligand in the assays of the invention, 
although most often compounds dissolved in aqueous or organic (especially DMSO-based) solutions 
are used. The assays are designed to screen large chemical libraries by automating the assay steps 
and providing compounds from any convenient source to assays, which are typically run in parallel 
(eg., mnricrotiter formats on microtiter plates inrobotic assays). It will be appreciated that there are 
many suppliers of chemical compounds, including Sigma (St. Louis, MO), Aldrich (St. Louis MO) 
Sigma-Aldrich (St. Louis, MO), Fluka Chemika-Biochemica Analyuka (Buchs, Switzerland) 'and the 
like. 
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In one convenient embodiment, high throughput screening methods involve providing a 
combinatorial chemical or peptide library containing a large number of potential therapeutic 
compounds (potential modulator or ligand compounds). Such "combinatorial chemical libraries" or 
"ligand libraries" are then screened in one or more assays, as described herein, to identify those 
library members (particularly chemical species or subclasses) that display a desired characteristic 
activity. The compounds thus identified can serve as conventional "lead compounds" or can 
themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical compounds generated 
by either chemical synthesis or biological synthesis, by combining a number of chemical "building 
blocks" such as reagents. For example, a linear combinatorial chemical library such as a polypeptide 
library ,s formed by combining a set of chemical building blocks (amino acids) in every possible way 
for a grven compound length «.e., the number of amino acids in a polypeptide compound). Millions 
of chemical compounds can be synthesized through such combinatorial mixing of chemical building 
blocks. g 

Preparation and screening of combinatorial chemical libraries is well known to those of skill 
in the art. Such combinatorial chemical libraries include, but are not limited to, peptide libraries (see, 
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e.g., U.S. Patent 5,010,175; Furka, Int J Pept Prot Res 37:487-493, 1991; and Houghton etal, Nature 
354:84-88, 1991). Other chemistries for generating chemical diversity libraries can also be used 
Such chemistries include, but are not limited to: peptides (e.g., PCT Publication No. WO 91/19735) 
encoded peptides (e.g., PCT Publication WO 93/20242), random bio-oligomers (eg PCT 
Publication No. WO 92/00091), benzodiazepines (e.g., U.S. Pat. No. 5,288,514), diversomers such as 
hydantoins, benzodiazepines and dipeptides (Hobbs et al, PNAS USA 90:6909-6913, 1993) 
vinylogous polypeptides (Hagihara et al., JAmer Chem Soc 1 14:6568, 1992), nonpeptidal ' 
peptidomimetics with glucose scaffolding (Hirschmann et al., JAmer Chem Soc 1 14:9217-9218 
1992), analogous organic syntheses of small compound libraries (Chen et al, JAmer Chem Soc ' 
116:2661, 1994), obgocarbamates (Cho etal. 1993 Science 261:1303), and/or peptidyl phosphonates 
(Campbell et al., J Org Chem 59:658, 1994), nucleic acid libraries (see Sambrook et al. Molecular 
Cloning, A Laboratory Manual, Cold Springs Harbor Press,N.Y., 1989; and Ausubel etal Current 
Protocols in Molecular Biology Green Publishing Associates and Wiley Interscience, N.Y 1989) 
peptide nucleic acid libraries (see, e.g., U.S. Patent 5,539,083), antibody libraries (see, e.g., Vaughn 
etal Nature Biotechnology 14:309-314, 1996; and PCT/US96/10287), carbohydrate Ubraries (see 
e.g, Liang et al, Science 274:1520-1522, 1996; and U.S. Patent 5,593,853), small organic molecule 
libraries (see, e.g., benzodiazepines, Baum 1993 C&EN, Jan 18, page 33; isoprenoids, U.S. Patent 
5,569,588; thiazolidionones and methathiazones, U.S. Patent 5,549,974; pyridines, U.S. Patents 
5,525,735 and 5,519,134; morpholino compounds, U.S. Patent 5,506,337; benzodiazepines 
20 5,288,514, and the like). 

Devices for the preparation of combinatorial libraries are commercially available (see e g. 
357 MPS, 390 MPS, Advanced Chem Tech, Louisville, KY; Symphony, Rainin, Woburn, MA- 433 A 
Apphed Biosystems, Foster City, CA; 9050 Plus, Millipore, Bedford, MA). In addition, numerous 
combinatorial libraries are themselves commercially available (see, e.g., ComGenex, Princeton, N J • 
25 Tnpos, Inc., St Louis, MO; 3D Pharmaceuticals, Exton, PA; Martek Biosciences, Columbia MD- ' 
etc.). 

C. Solid State and Soluble High Throughput Assays 

In one embodiment is provided soluble assays using molecules such as a domain such as a 
hgand binding domain, an extracellular domain, a transmembrane domain, a transmembrane domain 
and a cytoplasmic domain, an active site, a subunit association region, etc ; a domain that is 
covalently linked to a heterologous protein to create a chimeric molecule; a T2R bitter taste receptor 
protein variant/isoform; or a cell or tissue expressing a T2R bitter taste receptor protein 
variant/isoform, either naturally occurring or recombinant. Another embodiment provides solid 
phase based in vitro assays in a high throughput format, where the domain, chimeric molecule T2R 
bitter taste receptor protein variant/isoform, or cell or tissue expressing a specific T2R bitter taste 
receptor variant is attached to a solid phase substrate. It is particularly contemplated in some 
embodiments that multiple molecules are provided in such assays, for instance, a collection of two or 
more T2R isoforms proteins, or fragments thereof, such as those isoforms shown in SEQ ID NOs: 4 
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8, 10, 12, 14, 16, 20, 22, 24, 26, 28, 30, 32, 34, 36, 40, 42, 46, 50, 56, 58, 60, 64, 66, 68, 70, 72, 76, 
78, 80, 82, 84, 86, 90, 92, 94, 96, 100, 102, 104, 106, 108, 112, 114, 116, 118, 120, 122, 124, 126, ' 
128, 130, 136, 140, 142, 148, 150, 152, 156, 158, 162, 164, 170, 174, 176, 178, 180, 182, 184, 188, 
190, 192, 198, 200, 202, 204, 206, 210, 212, 214, 218, 220, 226, 228, 230, 232, 234, 236, 238^ 242^ 
244, 246, 248, 250, 252, 254, 258, 260, 264. 

In the high throughput assays, it is possible to screen up to several thousand different 
modulators or ligands in a single day. In particular, each well of a microtiter plate can be used to run 
a separate assay against a selected potential modulator, or, if concentration or incubation time effects 
are to be observed, every 5-10 wells can test a single modulator. Thus, a single standard microtiter 
plate can assay about 100 (e.g., 96) modulators. If 1536 well plates are used, then a single plate can 
easily assay from about 100- about 1500 different compounds. It is possible to assay several different 
plates per day; assay screens for up to about 6,000-20,000 different compounds are possible using the 
integrated systems of the invention. More recently, microfluidic approaches to reagent manipulation 
have been developed 

The molecule of interest can be bound to the solid state component, directly or indirectly, 
via covalent or non-covalent linkage, e.g., via a tag. The tag can be any of a variety of components. 
In general, a molecule which binds the tag (a tag binder) is fixed to a solid support, and the tagged 
molecule of interest (e.g., the taste transduction molecule of interest) is attached to the solid support 
by interaction of the tag and the tag binder. 

A number of tags and tag binders can be used, based upon known molecular interactions 
well described in the literature. For example, where a tag has a natural binder, for example, biotin, 
protein A, or protein G, it can be used in conjunction with appropriate tag binders (avidin, 
streptavidin, neutravidin, the Fc region of an immunoglobulin, etc.). Antibodies to molecules with 
natural binders such as biotin are also widely available and appropriate tag binders; see, SIGMA 
25 Immunochemicals 1998 catalogue SIGMA St. Louis, MO. 

Similarly, any haptenic or antigenic compound can be used in combination with an 
appropriate antibody to form a tag/tag binder pair. Thousands of specific antibodies are 
commercially available and many additional antibodies are described in the literature. For example, 
in one common configuration, the tag is a first antibody and the tag binder is a second antibody which 
recognizes the first antibody. In addition to antibody-antigen interactions, receptor-ligand 
interactions are also appropriate as tag and tag-binder pairs. For example, agonists and antagonists of 
cell membrane receptors (e.g., cell receptor-ligand interactions such as transferrin, viral receptor 
ligands, cytokine receptors, chemokine receptors, interleukin receptors, immunoglobulin receptors 
and antibodies, the cadherein family, the integrin family, the selectin family, and the like; (see, e.g., 
Pigott&Power 1993 The Adhesion Molecule Facts Book 1). Similarly, toxins andvenoms, viral 
epitopes, hormones (e.g., opiates, steroids, etc.), intracellular receptors (e.g., which mediate the 
effects of various small ligands, including steroids, thyroid hormone, retinoids and vitamin D; 
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peptides), drugs, lectins, sugars, nucleic acids (both linear and cyclic polymer configurations), 
oligosaccharides, proteins, phospholipids and antibodies can all interact with various cell receptors. 

Synthetic polymers, such as polyurethanes, polyesters, polycarbonates, polyureas, 
polyamides, polyemyleneimines, polyarylene sulfides, polysiloxanes, polyimides, and polyacetates 
can also form an appropriate tag or tag binder. Many other tag/tag binder pairs are also useful in 
assay systems described herein, as would be apparent to one of skill upon review of this disclosure. 

Common linkers such as peptides, polyethers, and the like can also serve as tags, and include 
polypeptide sequences, such as poly gly sequences of between about 5 and 200 amino acids. Such 
flexible linkers are known to persons of skill in the art For example, poIy(ethelyne glycol) linkers 
are available from Shearwater Polymers, Inc. Huntsville, Alabama. These linkers optionally have 
amide linkages, sulfhydryl linkages, or heterofunctional linkages. 

Tag binders are fixed to solid substrates using any of a variety of methods currently 
available. Solid substrates are commonly derivatized or functionalized by exposing all or a portion of 
the substrate to a chemical reagent which fixes a chemical group to the surface which is reactive with 
a portion of the tag binder. For example, groups which are suitable for attachment to a longer chain 
portion would include amines, hydroxyl, thiol, and carboxyl groups. Aminoalkylsilanes and 
hydroxyalkylsilanes can be used to functionalize a variety of surfaces, such as glass surfaces. The 
construction of such solid phase biopolymer arrays is well described in the literature. See, e.g., 
Merrifield, J Am Cliem Soc 85:2149-2154, 1963 (describing solid phase synthesis of, e.g., 'peptides); 
Geysen et al, JImmun Meth 102:259-274, 1987 (describing synthesis of solid phase components on 
pins); Frank & Doring, Tetrahedron 44:6031-6040, 1988 (describing synthesis of various peptide 
sequences on cellulose disks); Fodor et al. Science 251:767-777, 1991; Sheldon etal, Clinical 
Chemistry 39:718-719, 1993; and Kozal et al, Nature Medicine 2:753759, 1996 (all describing arrays 
of biopolymers fixed to solid substrates). Non-chemical approaches for fixing tag binders to 
substrates include other common methods, such as heat, cross-linking by UV radiation, and the like. 

D. Computer-based Assays 

Yet another assay for compounds that modulate taste receptor protein activity involves 
computer assisted drug design, in which a computer system is used to generate a three-dimensional 
structure of a target taste receptor protein based on the structural information encoded by its amino 
acid sequence. The input amino acid sequence interacts directly and actively with a preestablished 
algorithm in a computer program to yield secondary, tertiary, and quaternary structural models of the 
protein. The models of the protein structure are then examined to identify regions of the structure 
that have the ability to bind, e.g., ligands. These regions are then used to identify ligands that bind to 
the protein. 

The three-dimensional structural model of the protein is generated by entering protein amino 
acid sequences of at least 10 amino acid residues or corresponding nucleic acid sequences encoding a 
T2R bitter taste receptor polypeptide allelic variant into the computer system. The nucleotide 
sequence encoding the polypeptide, or the amino acid sequence thereof, can be any of the allelic 
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variant taste receptors described. The amino acid sequence represents the primary sequence or 
subsequence of the protein, which encodes the structural information of the protein. At least 10 
residues of the amino acid sequence (or a nucleotide sequence encoding 10 amino acids) are entered 
into the computer system from computer keyboards, computer readable substrates that include, but 
are not limited to, electronic storage media (e.g., magnetic diskettes, tapes, cartridges, and chips), 
optical media {e.g., CD ROM), information distributed by internet sites, and by RAM. The three- 
dimensional structural model of the protein is then generated by the interaction of the amino acid 
sequence and the computer system, using software known to those of skill in the art 

The amino acid sequence represents a primary structure that encodes the information 
necessary to form the secondary, tertiary and quaternary structure of the protein of interest. The 
software looks at certain parameters encoded by the primary sequence to generate the structural 
model. These parameters are referred to as "energy terms," and primarily include electrostatic 
potentials, hydrophobic potentials, solvent accessible surfaces, and hydrogen bonding. Secondary 
energy terms include van der Walls potentials. Biological molecules form the structures that 
minimize the energy terms in a cumulative fashion. The computer program is therefore using these 
terms encoded by the primary structure or amino acid sequence to create the secondary structural 
model. An example for G-protein cell receptors is presented in Vaidehi et at. {PNAS 99: 15308- 
153 12, 2002), which is incorporated herein by reference in its entirety. 

The tertiary structure of the protein encoded by the secondary structure is then formed on the 
basis of the energy terms of the secondary structure. The user at this point can enter additional 
variables such as whether the protein is membrane bound or soluble, its location in the body, and its 
cellular location, e.g., cytoplasmic, surface, or nuclear. These variable along with the energy terms of 
the secondary structure are used to form the model of the tertiary structure. In modeling the tertiary 
structure, the computer program matches hydrophobic faces of secondary structure with like, and 
25 hydrophilic faces of secondary structure with like. 

Once the structure has been granted, potential ligand binding regions are identified by the 
computer system. Three-dimensional structures for potential ligands are generated by entering amino 
acid or nucleotide sequences or chemical formulas of compounds, as described above. The three- 
dimensional structure of the potential ligand is then compared to that of the target taste receptor 
30 protein variant to identify ligands that bind to the protein. Binding affinity between the protein and 
hgands is determined using energy terms to determine which ligands have an enhanced probability of 
binding to the protein. 

Example 18: Pharmaceutical Preparations and Methods of Administration 
35 Taste modulators can be administered directly to the mammalian subject for modulation of 

taste, e.g., modulation of bitter taste, in vivo. Administration is by any of the routes normally used for 
mtroducmg a modulator compound into ultimate contact with the tissue to be treated, optionally the 
tongue or mouth. The taste modulators are administered in any suitable manner, optionally with 
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pharmaceutical^ acceptable carriers. Suitable methods of administering such modulators are 
available and well known to those of skill in the art, and, although more man one route can be used to 
administer a particular composition, a particular route can often provide a more immediate and more 
effective reaction than another route. 

Pharmaceutically acceptable carriers are determined in part by the particular composition 
being administered, as well as by the particular method used to administer the composition. 
Accordingly, there is a wide variety of suitable formulations of pharmaceutical compositions of the 
present invention (see, e.g., Remington's Pharmaceutical Sciences, H^ed. 1985). 

Formulations suitable for administration include aqueous and non-aqueous solutions, 
isotonic sterile solutions, which can contain antioxidants, buffers, bacteriostats, and solutes that 
render the formulation isotonic, and aqueous and non-aqueous sterile suspensions that can include 
suspending agents, solubilizers, thickening agents, stabilizers, and preservatives. In the practice of 
this invention, compositions can be administered, for example, orally. The formulations of 
compounds can be presented in unit-dose or multi-dose sealed containers, such as ampoules and vials 
Solutions and suspensions can be prepared from sterile powders, granules, and tablets of the kind 
previously described. The modulators can also be administered as part of a prepared food or drug. 

The dose administered to a patient, in the context of the present invention should be 
sufficient to effect a beneficial response in the subject over time. The dose will be determined by the 
efficacy of the particular taste modulators employed and the condition of the subject, as well as the 
body weight or surface area of the area to be treated. The size of the dose also will be determined by 
the existence, nature, and extent of any adverse side-effects that accompany the administration of a 
particular compound in a particular subject. 

In determining the effective amounts of the modulator to be administered, a physician may 
evaluate circulating plasma levels of the modulator, modulator toxicities, and the production of anti- 
modulator antibodies. In general, the dose equivalent of a modulator is from about 1 ng/kg to 10 
mg/kg for a typical subject. 

For administration, taste modulators of the present invention can be administered at a rate 
determined by the LD* of the modulator, and the side effects of the inhibitor at various 
concentrations, as applied to the mass and overall health of the subject. Administration can be 
30 accomplished via single or divided doses. 

Example 19: Kits 

Kits are provided which contain reagents useful for determining the presence or absence of 
polymorphism(s) in at least one T2R-encoding sequence, such as probes or primers specific for a T2R 
35 SNP shown in Figure 1, or a T2R haplotype allele shown in Table 7. Such kits can be used with the 
methods described herein to determine a subject's T2R genotype or haplotype, for one or more T2R 
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The provided kits may also include written instructions. The instructions can provide 
calibration curves or charts to compare with the determined (e.g., experimentally measured) values. 

The oligonucleotide probes and primers disclosed herein can be supplied in the form of a kit 
for use in detection of a specific T2R sequence, such as a SNP or haplotype described herein, in a 
5 subject. In such a kit, an appropriate amount of one or more of the oligonucleotide primers is 

provided in one or more containers. The oligonucleotide primers may be provided suspended in an 
aqueous solution or as a freeze-dried or lyophilized powder, for instance. The container(s) in which 
the oligonucleotide(s) are supplied can be any conventional container that is capable of holding the 
supplied form, for instance, microfuge tubes, ampoules, or bottles. In some applications, pairs of 

10 primers may be provided in pre-measured single use amounts in individual, typically disposable, 

tubes or equivalent containers. With such an arrangement, the sample to be tested for the presence of 
a T2R polymorphism can be added to the individual tubes and amplification carried out directly. 

The amount of each oligonucleotide primer supplied in the kit can be any appropriate 
amount, depending for instance on the market to which the product is directed. For instance, if the 

15 kit is adapted for research or clinical use, the amount of each oligonucleotide primer provided would 
likely be an amount sufficient to prime several PCR amplification reactions. Those of ordinary skill 
in the art know the amount of oligonucleotide primer that is appropriate for use in a single 
amplification reaction. General guidelines may for instance be found in Turn's et al (PCR Protocols, 
A Guide to Methods and Applications, Academic Press, Inc., San Diego, CA, 1990), Sambrook et al 

20 (hiMolecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York, 1989), and Ausubel 
et al (In Cwrent Protocols in Molecular Biology, Greene Publ. Assoc. and Wiley-Intersciences, 
1992). 

A kit may include more than two primers, in order to facilitate the in vitro amplification of 
T2R-encoding sequences, for instance a specific target T2R gene or the 5' or 3' flanking region 
25 thereof. 

In some embodiments, kits may also include the reagents necessary to carry out nucleotide 
amplification reactions, including, for instance, DNA sample preparation reagents, appropriate 
buffers (e.g., polymerase buffer), salts (e.g., magnesium chloride), and deoxyribonucleotides 
(dNTPs). 

30 ^ "* addition include either labeled or unlabeled oligonucleotide probes for use in 

detection of T2R polymorphism(s) or haplotypes. In certain embodiments, these probes will be 
specific for a potential polymorphic site that may be present in the target amplified sequences. The 
appropriate sequences for such a probe will be any sequence that includes one or more of the 
identified polymorphic sites, particularly those nucleotide positions indicated in Figure 1, such that 

35 the sequence the probe is complementary to a polymorphic site and the surrounding T2R sequence. 
By way of example, such probes are of at least 6 nucleotides in length, and the polymorphic site 
occurs at any position within the length of the probe. It is often beneficial to use longer probes, in 
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SNP positions, such as those described in Figure 1. 
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embodiments, these kits contain one or more microtiter plate assays, designed to detect 
polymorphism(s) in a T2R sequence of a subject, as described herein. 

Additional components in some of these kits may include instructions for carrying out the 
assay. Instructions will allow the tester to determine whether a specified T2R allele is present and 
whether it is homozygous or heterozygous. Reaction vessels and auxiliary reagents such as ' 
chromogens, buffers, enzymes, etc. may also be included in the kits. 

It may also be advantageous to provide in the kit one or more control sequences for use in 
the OLA reactions. The design of appropriate positive control sequences is well known to one of 
ordinary skill in the appropriate art. 



This disclosure provides a comprehensive collection of SNPs inbitter taste receptor genes 
including sub-sets that represent conserved, non-conserved, silent, and truncation mutations in the ' 
corresponding proteins, and individual allelic sequences and haplotypes for bitter taste receptor 
genes. The disclosure further provides methods for using the corresponding allelic variants of the 
taste receptor genes, alone or in various combinations, to test and characterize a subject's bitter 
tasting profile and to identify and analyze compounds that interact with and/or influence bitter tastes 
m different subjects and populations. It will be apparent that the precise details of the methods 
described may be varied or modified without departing from the spirit of the described invention We 
claim all such modifications and variations that fall within the scope and spirit of this disclosure, and 
2.0 all equivalents of such. 



